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0.1. PREFACE 


IX 


0.1 Preface 

In this book I present differential geometry and related mathematical topics with 
the help of examples from physics. It is well known that there is something 
strikingly mathematical about the physical universe as it is conceived of in 
the physical sciences. The convergence of physics with mathematics, especially 
differential geometry, topology and global analysis is even more pronounced in 
the newer quantum theories such as gauge field theory and string theory. The 
amount of mathematical sophistication required for a good understanding of 
modern physics is astounding. On the other hand, the philosophy of this book 
is that mathematics itself is illuminated by physics and physical thinking. 

The ideal of a truth that transcends all interpretation is perhaps unattain- 
able. Even the two most impressively objective realities, the physical and the 
mathematical, are still only approachable through, and are ultimately insepa- 
rable from, our normative and linguistic background. And yet it is exactly the 
tendency of these two sciences to point beyond themselves to something tran- 
scendentally real that so inspires us. Whenever we interpret something real, 
whether physical or mathematical, there will be those aspects which arise as 
mere artifacts of our current descriptive scheme and those aspects that seem to 
be objective realities which are revealed equally well through any of a multitude 
of equivalent descriptive schemes- “cognitive inertial frames” as it were. This 
theme is played out even within geometry itself where a viewpoint or interpre- 
tive scheme translates to the notion of a coordinate system on a differentiable 
manifold. 

A physicist has no trouble believing that a vector field is something beyond 
its representation in any particular coordinate system since the vector field it- 
self is something physical. It is the way that the various coordinate descriptions 
relate to each other (covariance) that manifests to the understanding the pres- 
ence of an invariant physical reality. This seems to be very much how human 
perception works and it is interesting that the language of tensors has shown up 
in the cognitive science literature. On the other hand, there is a similar idea as 
to what should count as a geometric reality. According to Felix Klein the task 
of geometry is 

“given a manifold and a group of transformations of the manifold, 
to study the manifold configurations with respect to those features 
which are not altered by the transformations of the group” 

-Felix Klein 1893 

The geometric is then that which is invariant under the action of the group. 
As a simple example we may consider the set of points on a plane. We may 
impose one of an infinite number of rectangular coordinate systems on the plane. 
If, in one such coordinate system (x,y), two points P and Q have coordinates 
(x(P),y(P)) and (x(Q),y(Q)) respectively, then while the differences Ax = 
x(P) — x(Q) and Ay = y(P) — y(Q) are very much dependent on the choice of 
these rectangular coordinates, the quantity (Ax) 2 + (Ay) 2 is not so dependent. 
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If (X, Y) are any other set of rectangular coordinates then we have (Aa ;) 2 + 
(Ay ) 2 = (AA ) 2 + (AF) 2 . Thus we have the intuition that there is something 
more real about that later quantity. Similarly, there exists distinguished systems 
for assigning three spatial coordinates ( x , y, z) and a single temporal coordinate 
t to any simple event in the physical world as conceived of in relativity theory. 
These are called inertial coordinate systems. Now according to special relativity 
the invariant relational quantity that exists between any two events is (Aa :) 2 + 
(Ay ) 2 + (A z ) 2 — (At) 2 . We see that there is a similarity between the physical 
notion of the objective event and the abstract notion of geometric point. And 
yet the minus sign presents some conceptual challenges. 

While the invariance under a group action approach to geometry is powerful 
it is becoming clear to many researchers that the looser notions of groupoid and 
pseudogroup has a significant role to play. 

Since physical thinking and geometric thinking are so similar, and even at 
times identical, it should not seem strange that we not only understand the 
physical through mathematical thinking but conversely we gain better mathe- 
matical understanding by a kind of physical thinking. Seeing differential geom- 
etry applied to physics actually helps one understand geometric mathematics 
better. Physics even inspires purely mathematical questions for research. An 
example of this is the various mathematical topics that center around the no- 
tion of quantization. There are interesting mathematical questions that arise 
when one starts thinking about the connections between a quantum system and 
its classical analogue. In some sense, the study of the Laplace operator on a 
differentiable manifold and its spectrum is a “quantized version” of the study of 
the geodesic flow and the whole Riemannian apparatus; curvature, volume, and 
so forth. This is not the definitive interpretation of what a quantized geometry 
should be and there are many areas of mathematical research that seem to be 
related to the physical notions of quantum verses classical. It comes as a sur- 
prise to some that the uncertainty principle is a completely mathematical notion 
within the purview of harmonic analysis. Given a specific context in harmonic 
analysis or spectral theory, one may actually prove the uncertainty principle. 
Physical intuition may help even if one is studying a “toy physical system” that 
doesn’t exist in nature or only exists as an approximation (e.g. a nonrelativistic 
quantum mechanical system). At the very least, physical thinking inspires good 
mathematics. 

I have purposely allowed some redundancy to occur in the presentation be- 
cause I believe that important ideas should be repeated. 

Finally we mention that for those readers who have not seen any physics 
for a while we put a short and extremely incomplete overview of physics in 
an appendix. The only purpose of this appendix is to provide a sort of warm 
up which might serve to jog the readers memory of a few forgotten bits of 
undergraduate level physics. 



Chapter 1 

Preliminaries and Local 
Theory 


I never could make out what those damn dots meant. 
Lord Randolph Churchill 


Differential geometry is one of the subjects where notation is a continual 
problem. Notation that is highly precise from the vantage point of set theory and 
logic tends to be fairly opaque with respect to the underlying geometric intent. 
On the other hand, notation that is uncluttered and handy for calculations tends 
to suffer from ambiguities when looked at under the microscope as it were. It 
is perhaps worth pointing out that the kind of ambiguities we are talking about 
are accepted by every calculus student without much thought. For instance, 
we find ( x,y,z ) being used to refer variously to “indeterminates” , “a triple of 
numbers” , or functions of some variable as when we write 


x{t) = (x(t),y(t),z(t)). 


Also, we often write y = f(x) and then even write y = y{ x) and y'(x) or dy/dx 
which have apparent ambiguities. This does not mean that this notation is bad. 
In fact, it can be quite useful to use slightly ambiguous notation. In fact, human 
beings are generally very good at handling ambiguity and it is only when the self 
conscious desire to avoid logical inconsistency is given priority over everything 
else do we begin to have problems. The reader should be warned that while 
we will develop fairly pedantic notation we shall also not hesitate to resort to 
abbreviation and notational shortcuts as the need arises (and with increasing 
frequency in later chapters). 

The following is short list of notational conventions: 
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Category 

Sets 

Elements 

Maps 

Vector Spaces 

V , W 

v,w, x,y 

A, B, A, L 

Topological vector spaces (TVS) 

E,V,W,R" 

v, w,x,y 

A, B, A, L 

Open sets in TVS 

u,v,o,u a 

p,x,y,v,w 


Lie Groups 

G, H, K 

g,h,x,y 

h, /, g 

The real (resp. complex) numbers 

R, (resp.C) 

t,s,x,y,z, 

f,g,h 

One of R or C 

F 

H 



more complete chart may be found at the end of the book. 

The reader is reminded that for two sets A and B the Cartesian product 
Ax B is the set of pairs A x B := {(a, b) : a € A, b £ B}. More generally, 
lit ; = {( a i) : a i € Af). 

Notation 1.1 Here and throughout the book the symbol combination “:= ” 
means “equal by definition’. 


In particular, R" := 1 x • • • x R the product of -copies of the real numbers 
R. Whenever we represent a linear transformation by a matrix, then the matrix 
acts on column vectors from the left. This means that in this context elements 
of R” are thought of as column vectors. It is sometimes convenient to repre- 
sent elements of the dual space (R”)* as row vectors so that if a € (R™)* is 
represented by (ai, ...,a„) and v € R n is represented by (v 1 , ■■■,v n ) then 


y(v) = (ax a„) 


Since we do not want to have to write the transpose symbol in every instance 
and for various other reasons we will sometimes use upper indices (superscripts) 
to label the component entries of elements of R” and lower indices (subscripts) 
to label the component entries of elements of (R")*. Thus (v 1 1 ...,v n ) invites 
one to think of a column vector ( even when the transpose symbol is not present) 
while (a±, ...,a n ) is a row vector. On the other hand, a list of elements of a 
vector space such as a basis will be labelled using subscripts while superscripts 
label lists of elements of the dual space of the initially introduced space. 


1.1 Calculus 

Let V be a finite dimensional vector space over a field F where F is the real num- 
bers R or the complex numbers C. Occasionally the quaternion number algebra 
H (a skew field) will be considered. Each of these spaces has a conjugation map 
which we take to be the identity map for R while for C and H we have 

x + yi e- > x — yi and 
x + yi + zj + tck x — yi ^ zj — wk 
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respectively. The vector spaces that we are going to be dealing with will serve as 
local models for the global theory. For the most part the vector spaces that serve 
as models will be isomorphic to F” which is the set of n — tuples (ad, ...,x n ) of 
elements of F. However, we shall take a slightly unorthodox step of introducing 
notation that exibits the variety of guises that the space F” may appear in. 
The point is that althought these spaces are isomorphic to F" some might have 
different interpretations as say matrices, bilinear forms, tensors, and so on. We 
have the following spaces: 


1. F" which is the set of ?i-tuples of elements of F which we choose to think 
of as column vectors. These are written as (ad, ..., x n ) ( or more commonly 
simply as (ad, ..., x n ) where the fact that we have place the indices as 
superscripts is enough to remind us that in matrix multiplication these 
are supposed to be columns. The standard basis for this vector space 
is {e.;}i<j< n where e,; has all zero entries except for a single 1 in the i- 
th position. The indices have been lowered on purpose to facilitate the 
expression like v = ]Tb tAe*. The appearence of a repeated index one of 
which is a subscript while the other a superscript, signals a summation. 
Acording to the Einstien summatation convention we may omit the ]Tb 
and write v = v l &i the summation being implied. 

2. F„ is the set n-tuples of elements of F thought of as row vectors. The 
elements are written (£i,...,£ n ) • This space is often identified with the 
dual of F" where the pairing becomes matrix multiplication (£, v) = £w. 
Of course, we may also take F" to be its own dual but then we must write 
(v, w) = v*w. 

3. F(( t is just the set of m x n matrices, also written M mxn . The elements are 
written as (a ;*•). The standard basis is {e^ } so that A = (A*) = JT ■ Aje- 

4. F m>n is also the set of m x n matrices but the elements are written as (x t j) 
and these are thought of as giving maps from F” to F„ as in ( v l ) i— > (u>, ; ) 
where w, : = y~h XjjV 3 . A particularly interesting example is when the 
F = K. Then if ( g ^ ) is a symmetic positive definite matrix since then we 
get an isomorphism g : F™ = F„ given by Vi = JT gijV 3 . This provides us 
with an inner product on F„ given by y~h g i jW' t v J and the usual choice for 
gij is Sij = 1 if i = j and 0 otherwise. Using 6ij makes the standard basis 
on F” an orthonormal basis. 

5. Fj is the set of all elements of F indexed as Xj where / el and I £ J 
for some indexing sets X and J . The dimension of F X j is the cardinality 
of X xj. To look ahead a bit, this last notation comes in handy since it 
allows us to reduce a monster like 


= E 

k\ ,fc2, 




Lk 1 k 2 ...k„ 
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to something like 

c 7 = a K bR 

K 

which is more of a “cookie monster” . 

In every case of interest V has a natural topology that is de- 
termined by a norm. For example the space F x (:= ) has an 

inner product (vi,V 2 ) = Ylie where Vi = 0,1 e i and 

V 2 = y, fcT b 1 e/. The inner product gives the norm in the usual way 
|v| := (v, v) 1 / 2 which determines a topology on V. Under this topol- 
ogy all the vector space operations are continuous. Futhermore, all 
norms give the same topology on a finite dimensional vector space. 


§§ Interlude §§ 

Infinite dimensions. 

What about infinite dimensional spaces? Are there any “standard” spaces 
in the infinite dimensional case? Well, there are a few problems that must 
be addresses if one want to include infinite dimensional spaces. We will not 
systematically treat infinite dimensioanl manifold theory but calculas on infinite 
dimensional spaces can be fairly nice if one restricts to complete normed spaces 
(Banach spaces). As a sort of warm up let us step through a progressively 
ambitious attemp to generalize the above spaces to infinite dimensions. 

1. We could just base our generalization on the observation that an element 
(x l ) of F” isreally just a function x : {1, 2, ..., n} — > F so maybe we should 
consider instead the index set N = {1,2,3,... — > oo}. This is fine except 
that we must interpret the sums like v = v l ei. The reader will no doubt 
realize that one possible solution is to restrict to oo-tuple (sequences) that 
are in I 2 . This the the Hilbert space of square summable sequences. This 
one works out very nicely although there are some things to be concerned 
about. We could then also consider spaces of matrices with the rows and 
columns infinite but square summable these provide operetors I 2 — > £ 2 . 
But should we restrict to trace class operators? Eventually we get to 
tensors where which would have to be indexed “tuples” like (T{? fc ) which 
are square summable in the sense that ^(Y{? fc ) 2 < oo. 

2. Maybe we could just replace the indexing sets by subsets of the plane 
or even some nice measure space fi. Then our elements would just be 
functions and imediately we see that we will need measurable functions. 
We must also find a topology that will be suitable for defining the limits 
we will need when we define the derivative below. The first possiblity is 
to restrict to the square integrable functions. In other words, we could 
try to do everything with the Hilbert space L 2 (H). Now what should the 
standard basis be? OK, now we are starting to get in trouble it seems. 
But do we really need a standard basis? 
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3. It turns out that all one needs to do calculus on the space is for it to be a 
sufficiently nice topological vector space. The common choice of a Banach 
space is so that the proof of the inverse function theorem goes through 
(but see [KM]). 


The reader is encouraged to look at the appendices for a more 
formal treatment of infinite dimensional vector spaces. 


Next we record those parts of calculus that will be most important to our 
study of differentiable manifolds. In this development of calculus the vector 
spaces are of one of the finite dimensional examples given above and we shall 
refer to them generically as Euclidean spaces. For convenience we will restrict 
our attention to vector spaces over the real numbers R. Each of these “Eu- 
clidean” vector spaces has a norm where the norm of x denoted by ||x||. On the 
other hand, with only minor changes in the proofs, everything works for Banach 
spaces. In fact, we have put the proofs in an appendix where the spaces are 
indeed taken to be general Banach spaces. 

Definition 1.1 Let V and W be Euclidean vector spaces as above (for example 
R™ and R 1 ™,). Let U an open subset of V. A map f : U — > W is said to be 
differentiable at x GU if and only if there is a (necessarily unique) linear map 
Df\ p : V — » W such that 

/(x + v) - /(x) - Df\ p M 
Inn — n 

M->° IMI 

Notation 1.2 We will denote the set of all linear maps from V to W by L(V, W). 
The set of all linear isomorphisms from V onto W will be denoted by GL(V, W). 
In case, V = W the corresponding spaces will be denoted by fll(V) and GL(W). 
For linear maps T : V — > W we sometimes write T ■ v instead ofT(y) depending 
on the notational needs of the moment. In fact, a particularly useful notational 
device is the following: Suppose we have map A : X — » L(V; W). Then we would 
write A(x)- v or A\ x v. 

Here GL(V) is a group under composition and is called the general linear 
group . In particular, GL(V, W) is a subset of L(V, W) but not a linear subspace. 

Definition 1.2 Let V,,, i = 1 and\N be finite dimensional ¥ -vector spaces. 
A map p :Vi x ■ ■ ■ x — ► W is called multilinear (k-multilinear) if for each 
i, 1 < i < k and each fixed (w lt ...,w ..., w *.) gV) x ••• x Vj x x\4 we have 
that the map 

i-th 
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obtained by fixing all bat. the i-th variable is a linear map. In other words, we 
require that p be F- linear in each slot separately. 

The set of all multilinear maps Vi x • • • x Vj, — > W will be denoted by 
L( V l5 ..., V ; W ) . If Vx = • • • = Vfc = V then we write L k ( V;W) instead of 
L( V, V; W) 

Since each vector space has a (usually obvious) inner product then we have 
the group of linear isometries 0( V) from V onto itself. That is, 0( V) consists 
of the bijective linear maps d> : V— > V such that ($v, $w) = (v, w) for all 

v, w € V. The group ()( V) is called the orthogonal group. 

Definition 1.3 A (bounded) multilinear map p : V x ■■■ x \/ —> \N is called 
symmetric (resp. skew- symmetric or alternating) iff for any vi, V 2 , v*, £ 
V we have that 


M v l) v 2j — ) v fc) = K(v a 1, V^-2, —Nak) 

resp. v 2 , v fc ) = S 5 n(cr)/i(v (7 i, v^, v CTfc ) 

for all permutations a on the letters {1,2 TTie set of all bounded sym- 
metric (resp. skew-symmetric) multilinear maps V x • • • x V — > W is denoted 
L k sym (V; W) {resp. Lj fcew (V;W) orL* t (V;W)). 

Now the space L(V, W) is a normed space with the norm 

||/|| = sup = sup{||/(v)|| w : ||v|| v = 1}. 

vev II v|| v 

The spaces L(V i, .... V^; W) also have norms given by 

IMI : = sup{||/x(v 1 ,v 2 ,...,v fc )|| w : 1 1 v j 1 1 v . = 1 for i = 1 ,..,k} 

Notation 1.3 In the context of K n , we often use the so called “multiindex 
notation”. Let a = (au, ...,a n ) where the ai are integers and 0 < cti < n. Such 
an n-tuple is called a midtiindex. Let |a| := a\ + ... + a n and 

d a f _ 

~dxC d(x 1 ) a ^d(x 1 ) a ^ ■ ■ ■ d(ad) a ™‘ 

Proposition 1.1 There is a natural linear isomorphism L(V, L(V, W)) = L 2 (V, W) 
given by 

l(v i)(u 2 ) < — * l(v i,v 2 ) 

and we identify the two spaces. In fact, L(V, L(V, L(V, W)) = L 3 (V; W) and in 
general L(V, L(V, L(V, ..., L(V, W)) S L fc (V; W) etc. 

Proof. It is easily checked that if we just define (t T)(ri)(r 2 ) = T(u l5 r 2 ) 
then r T <-> T does the job for the k = 2 case. The k > 2 case can be done by 
an inductive construction and is left as an exercise. It is also not hard to show 
that the isomorphism norm preserving. ■ 
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Definition 1.4 If it happens that a function f is differentiable for all p through- 
out some open set U then we say that f is differentiable on U. We then have 
a map Df : U C V — >L(V, W) given by p i— > Df( p). If this map is differen- 
tiable at some p € V then its derivative at p is denoted DDf(p) = D 2 f( p) or 
D 2 f | and is an element of L(V , L(V , W)) = L 2 (V;W). Similarly, we may in- 
ductively define D k f £ L fc (V; W) whenever f is sufficiently nice that the process 
can continue. 

Definition 1.5 We say that a map f : U C V — » W is C r — differentiable on 
U if D r f | p £ L r (V,W) exists for all p £ U and if D r f is continuous as map 
U — > L r (V,W). If f is C r — differentiable on U for all r > 0 then we say that f 
is C°° or smooth (on U). 

Definition 1.6 A bisection f between open sets U a C V and Up C W is called 
a C r — diffeomorphism iff f and f~ x are both C r — differentiable (on U a and 
Up respectively). If r — oo then we simply call f a diffeomorphism. Often, we 
will have W = V in this situation. 

Let U be open in V. A map f : U — » W is called a local C r diffeomorphism 
iff for every p £ U there is an open set U p C U with p £ U p such that f\ u : 
U p — > /(C/ p ) is a C r — diffeomorphism. 

In the context of undergraduate calculus courses we are used to thinking 
of the derivative of a function at some a £ R. as a number f'(a ) which is the 
slope of the tangent line on the graph at (a, f(a)). From the current point of 
view Df(a) = Df | just gives the linear transformation h i— > /'(a) • h and the 
equation of the tangent line is given by y = f(a) + f\a)(x — a). This generalizes 
to an arbitrary differentiable map as y = /(a) + Df( a) • (x — a) giving a map 
which is the linear approximation of / at a. 

We will sometimes think of the derivative of a curve 1 c : / C R — > E at to G I, 
written c(t 0 )> as a velocity vector and so we are identifying c(t 0 ) £ L(R, E) with 
Dc | t • 1 £ E. Here the number 1 is playing the role of the unit vector in R. 

Let / : U C E — » F be a map and suppose that we have a splitting 
E= E 1 XE 2 X ■ ■ • E n for example . We will write /(x 1 , ..., x") for (x, ...,x") £ 
E 1 xE 2 x • • • E n . Now for every a =(a 1 , a") £ EjX • • • x E„ we have the partial 
map / a ,; : y 1 — > /(a 1 , ..., y, ...a”) where the variable y is is in the i slot. This de- 
fined in some neighborhood of a* in E t . We define the partial derivatives when 
they exist by Z//( a) = Df 3 p(a l ). These are, of course, linear maps. 

A/(a) : Ej — » F 

The partial derivative can exist even in cases where / might not be differentiable 
in the sense we have defined. The point is that / might be differentiable only 
in certain directions. 


1 We will often use the letter I to denote a generic (usually open) interval in the real line. 
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If / has continuous partial derivatives Dif(x) : Ej — > F near x £ E = Ei x Eo x • • 
then Df(x) exists and is continuous near p. In this case, 


Df{x) • v 

n 

= ^2 D if(x,y) ■ v l 

i = 1 


where v = (v , v"). 


§ Interlude § 

Thinking about derivatives in infinite dimensions 

The theory of differentiable manifolds is really just an extension of calculus 
in a setting where, for topological reasons, we must use several coordinates sys- 
tems. At any rate, once the coordinate systems are in place many endeavors 
reduce to advanced calculus type calculations. This is one reason that we re- 
view calculus here. However, there is another reason. Namely, we would like to 
introduce calculus on Banach spaces. This will allow us to give a good formula- 
tion of the variational calculus that shows up in the study of finite dimensional 
manifolds (the usual case). The idea is that the set of all maps of a certain type 
between finite dimensional manifolds often turns out to be an infinite dimen- 
sional manifold. We use the calculus on Banach spaces idea to define infinite 
dimensional differentiable manifolds which look locally like Banach spaces. All 
this will be explained in detail later. 

As a sort of conceptual warm up, let us try to acquire a certain flexibility 
in the way we think about vectors. A vector as it is understood in some con- 
texts is just an n— tuple of numbers which we picture either as a point in R" 
or an arrow emanating from some such point but an n— tuple (x 1 , ...,x n ) is also 
a function x : i i— > x(i) = x l whose domain is the finite set {l,2,...,n}. But 
then why not allow the index set to be infinite, even uncountable? In doing 
so we replace the n— tuple ( x l ) be the “continuous” tuple /(x). We are used 
to the idea that something like x l y l should be replaced by an integral 

/ f( x )g( x )dx when moving to these continuous tuple (functions). Another ex- 
ample is the replacement of matrix multiplication a* v* by the continuous 
analogue / a(x,y)v(y)dy. But what would be the analogue of a vector valued 
functions of a vector variable? Mathematicians would just consider these to be 
functions or maps again but it is also traditional, especially in physics literature, 
to called such things functionals. An example might be an “action functional”, 
say S, defined on a set of curves in R 3 with a fixed interval fioTi] as domain: 

S'fc] = f L(c(t),c(t),t)dt 
Jto 

Here, L is defined on R 3 x R 3 x [to, ^l] but S takes a curve as an argument and 
this is not just composition of functions. Thus we will not write the all too 
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common expression “S[c(£)]”. Also, ,S'[c] denotes the value of the functional at 
the curve c and not the functional itself. Physicists might be annoyed with this 
but it really does help to avoid conceptual errors when learning the subject of 
calculus on function spaces (or general Banach spaces). 

When one defines a directional derivative in the Euclidean space R" := 
{: x=(x 1 , ... ,x n ) : i' £ R} it is through the use of a difference quotient: 


D h f{x) := lim 
£ — ^0 


f(x + eh) - f(x) 
£ 


which is the same thing as Dhf{x) := f{x + eh). Limits like this one 

make sense in any topological vector space 2 . For example, if C([0, 1]) denotes 
the space of continuous functions on the interval [0, 1] then one may speak of 
functions whose arguments are elements of C([0, 1]). Here is a simple example 
of such a “functional” : 

F[f] ■= [ f 2 (x)dx 

J[ o,i] 

The use of square brackets to contain the argument is a physics tradition that 
serves to warn the reader that the argument is from a space of functions. Notice 
that this example is not a linear functional. Now given any such functional, say 
F, we may define the directional derivative of F at / £ C([0, 1]) in the direction 
of the function h. G C([0, 1]) to be 


D h F(f) := lim 
£ — ^0 


F(f + eh)-F(f) 

£ 


whenever the limit exists. 


Example 1.1 Let F[f] := Jj Q ^ f 2 (x)dx as above and let f(x) = x 3 and h(x) = 
sin(a: 4 ). Then 


D h F(f) = lim -F(f + eh) — F(f) 
£— >0 £ 


_d 

de 

d_ 

de 


F(f + eh) 


£=0 


c =0«/[0,l] 


( f(x ) + eh{x)) 2 dx 


2 f f(x)h.(x)dx = 2 f x 3 sin(7rx 4 )da; 

^[ 0 , 1 ] Jo 


1 

7T 


Note well that h and / are functions but here they are, more importantly, 
“points” in a function space! What we are differentiating is F. Again, F[f] is 
not a composition of functions; / is the dependent variable here. 

2 “Topological vector space” will be defined shortly. 
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Exercise 1.1 See if you can make sense out of the expressions and analogies 
in the following chart: 


X 

> 

/ 

f(x) 

> 

m 

df 


SF 

9L 

dx % 

> 

SF 
< 5 /(*) 


> 

I 6f( x ) V ( X ) dx 

/•••/ f(x)dx 1 ...dx n 


niF[fm x df( X )) 


Exercise 1.2 Some of these may seem mysterious- especially the last one which 
still lacks a general rigorous definition that covers all the cases needed in quan- 
tum theory. Don’t worry if you are not familiar with this one. The third one 
in the list is only mysterious because we use 5. Once we are comfortable with 
calculus in the Banach space setting we will see that 6F just mean the same 
thing as dF whenever F is defined on a function space. In this context dF is a 
linear functional on a Banach space. 


So it seems that we can do calculus on infinite dimensional spaces. There are 
several subtle points that arise. For instance, there must be a topology on the 
space with respect to which addition and scalar multiplication are continuous. 
This is the meaning of topological vector space. Also, in order for the derivative 
to be unique the topology must be Hausdorff. But there are more things to 
worry about. 

We are also interested in having a version of the inverse mapping theorem. It 
turns out that most familiar facts from calculus on R" go through if we replace 
R" by a complete normed space (see 26.21). There are at least two issues that 
remain even if we restrict ourselves to Banach spaces. First, the existence of 
smooth bump functions and smooth partitions of unity (to be defined below) are 
not guaranteed. The existence of smooth bump functions and smooth partitions 
of unity for infinite dimensional manifolds is a case by case issue while in the 
finite dimensional case their existence is guaranteed . Second, there is the 
fact that a subspace of a Banach space is not a Banach space unless it is a 
closed subspace. This fact forces us to introduce the notion of a split subspace 
and the statements of the Banach spaces versions of several familiar theorems, 
including the implicit function theorem, become complicated by extra conditions 
concerning the need to use split (complemented) subspaces. 
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1.2 Chain Rule, Product rule and Taylor’s The- 
orem 

Theorem 1.1 (Chain Rule) LetUi andU 2 be open subsets of Euclidean spaces 
Ei and E 2 respectively. Suppose we have continuous maps composing as 

Eg 

where E 3 is a third Euclidean space. If f is differentiable at p and g is dif- 
ferentiable at f(p) then the composition is differentiable at p and D(g o /) = 
Dg{f{ p)) o Dg( p). In other words, if v € Ei then 

D (g° f)\ p -v= Dg\ f{p) ■ (Df\ p ■ v). 

Furthermore, if f € C r (U \ ) and g € C r (U 2 ) then g o f e C r (U\). 

We will often use the following lemma without explicit mention when calcu- 
lating: 

Lemma 1.1 Let f : U CV-»W be twice differentiable at x 0 € U C V then 
the map D v f : x i— > Df(x) ■ v is differentiable at Xo and its derivative at Xo is 
given by 

D(D v f)\ xg ■ h = -D 2 /(x 0 )(h, v). 

Theorem 1.2 1/ / : (7 C V - » W is twice differentiable on U such that D 2 f is 
continuous, i.e. if f € C 2 (U) then D 2 f is symmetric: 

D 2 f(p)(w,v) = D 2 /(p)(v,w). 

More generally, if D k f exists and is continuous then D k f (p) € (V; W). 

Theorem 1.3 Let g € L{ Fi, F 2 ; W) be a bilinear map and let f\ : U C E — > Fi 
and f 2 : U C E — > F 2 be differentiable (resp. C r ,r > 1) maps. Then the 
composition g(fi, fi) is differentiable (resp. C r ,r > 1) on U where g{f\,f 2 ) ■ 
x ^ £>(/i ( x ) , fi (x)). Furthermore, 

Dg\ x (/i,/ 2 ) • v =g{Df 1 | x • v, / 2 (x)) + £>(/i(x), Df 2 | x • v). 

In particular, if F is an algebra with product * and f\ : U C E — > F and f 2 ■ U C 
E — * F then fi * / 2 is defined as a function and 

D(f± * / 2 ) • v = ( Dfi ■ v) * (/ 2 ) + (Dff ■ v) ★ (. Dfi ■ v). 


1.3 Local theory of maps 

Inverse Mapping Theorem 

Definition 1.7 Let E and F be Euclidean vector spaces. A map will be called a 
C r diffeomorphism near p if there is some open set U C dom(/) containing 
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p such that f\jj.'U — > f(U) is a C r diffeomorphism onto an open set f(U). The 
set of all maps which are diffeomorphisms near p will be denoted Diffp(E, F). If 
f is a C r diffeomorphism near p for all p £ U = dom(/) then we say that f is 

a local C r diffeomorphism. 

Theorem 1.4 (Implicit Function Theorem I) Let E 1; E 2 and F Euclidean 
vector spaces and let U x V C Ei x E 2 be open. Let f : U x V — » F be a 
C r mapping such that f(x o,yo) — 0. If H 2 /( X0j y 0 ) : E 2 — > F is a continuous linear 
isomorphism then there exists a (possibly smaller) open set Uo C U with Xo € Uq 
and unique a mapping g : Uq — > V with g(x o) = yo such that 

/(x,5(x)) = 0 


for all x € Uq. 


Proof. Follows from the following theorem. ■ 

Theorem 1.5 (Implicit Function Theorem II) Let Ei, E 2 and F be as above 
and U x V C Ei x E 2 open. Let f : U x V — > F be a C r mapping such that 
/(x 0 ,y 0 ) = wo- IfD 2 f(x 0 ,y 0 ) : E 2 — > F is a continuous linear isomorphism then 
there exists (possibly smaller) open sets Uq C U and Wq C F with x 0 € Uo and 
w 0 £ Wq together with a unique mapping g : Uq x Wo —> > V such that 

/(x,ff(x,w)) = w 

for all x £ Uq. Here unique means that any other such function h defined on a 
neighborhood U( x W( will equal g on some neighborhood o/(xo,wo). 

Proof. Sketch: Let 4' : U x V — * Ei x F be defined by 4'(x, y) = (x, /(x, y)). 
Then D4'(x 0 ,yo) has the operator matrix 

id_Ej 0 

T>i/(x 0 ,yo) D 2 f{x 0 ,y 0 ) _ 

which shows that D4'(xo,yo) is an isomorphism. Thus 4' has a unique local 
inverse 4/ -1 which we may take to be defined on a product set Uo x Wq. Now 
4'~ 1 must have the form (x, y) i— > (x, g(x, y)) which means that (x, /(x, g(x, w))) = 
4>(x, g(x, w)) = (x, w). Thus /(x, g(x,w)) = w. The fact that g is unique follows 
from the local uniqueness of the inverse 4> _1 and is left as an exercise. ■ 

Let U be an open subset of V and let I C ffi. be an open interval containing 
0. A (local) time dependent vector field on U is a C r -map F : / x U — > V (where 
r > 0) . An integral curve of F with initial value Xq is a map c defined on an 
open subinterval J C / also containing 0 such that 

c'(f) = F(t, c(t)) 
c( 0) = x 0 

A local flow for F is a map a : I 0 x U 0 — > V such that U 0 C U and such that 
the curve a x (t) = aft, x) is an integral curve of F with a x (0) = x 

If / : U — > V is a map between open subsets of V and W we have the notion 
of rank at p £ U which is just the rank of the linear map D p f : V — > W. 
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Definition 1.8 Let X, Y be topological spaces. When we write f :: X — > Y we 
imply only that f is defined on some open set in X. If we wish to indicate that 
f is defined near p £ X and that /( p) = q we will used the pointed category 
notation together with the symbol 

/:: (X, p)-(Y.q) 

We will refer to such maps as local maps at p. Local maps may be com- 
posed with the understanding that the domain of the composite map may become 
smaller: Iff :: (X, p) -> (Y, q) and g :: (Y , q) -4 (G, z) then gof :: (X. p) -4 (G, z) 
and the domain of g o f will be a non-empty open set. 

Theorem 1.6 (The Rank Theorem) Let f : (V,p) —4 (W, q) be a local map 
such that Df has constant rank r in an open set containing p. Suppose that 
dim(V) = n and dim(W) = to Then there are local diffeomorphisms g\ :: 
( \/,p ) — > (R",0) and g 2 :: (W, g) — > (R m ,0) such that g 2 o / ° gf 1 is a local 
diffeomorphism near 0 with the form 

(x 1 , ....x n ) (a: 1 , ....x r , 0, 0). 

Proof. Without loss of generality we may assume that / : (R", 0) — > (R m , 0) 
and that (reindexing) the r x r matrix 

(dp\ 

\ dxj Jl<i,j<r 

is nonsingular in an open ball centered at the origin of R”. Now form a map 
gi(x 1 , ....x n ) = (f 1 (x ), ..., f r (x), x r+l , ...,x n ). The Jacobian matrix of g\ has 
the block matrix form 

(&i) 

0 I n —r 

which clearly has nonzero determinant at 0 and so by the inverse mapping 
theorem g\ must be a local diffeomorphism near 0. Restrict the domain of g\ 
to this possibly smaller open set. It is not hard to see that the map / o gf 1 
is of the form (z 1 , ..., z n ) (z 1 ,...,z r , 7 r+1 (z), ..., y m (z)) and so has Jacobian 
matrix of the form 

I r 0 
* (£) ' 

Now the rank of ( ) must be zero near 0 since the rank(/) = 

rank(/ oh" 1 ) = r near 0. On the said (possibly smaller) neighborhood we now 
define the map g 2 : (R m , q) — > (R m , 0) by 

(: y\ y m ) - (y 1 , y\ y r+1 - Y +1 (y*, o), ..., y m - 7 m (y*, o)) 

where (j/*, 0) = (y, ..., y r , 0, 0). The Jacobian matrix of g 2 has the form 


I r 0 

* I 
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and so is invertible and the composition 32 ° / ° 3i 1 has the form 
z f ^i ( 2 », 7 r+ i(z),..., 7 m (z)) 

S (z*,"/ r +i(z) -7 P ^i(z*,0),...,7 m (z) -7 m (2*,0)) 

where (2*, 0 ) = (2 1 , 2 r , 0 , 0 ). It is not difficult to check that 32 0 / 0 3f : 
has the required form near 0. ■ 

Starting with a fixed V, say the usual example F", there are several stan- 
dard methods of associating related vector space using multilinear algebra. The 
simplest example is the dual space (F")*. Now beside F" there is also F„ which 
is also a space of n— tuples but this time thought of as row vectors. We shall 
often identify the dual space (F n )* with F„ so that for v € F" and £ € (F n )* 
the duality is just matrix multiplication £(v) = £v. The group of nonsingular 
matrices, the general linear group Gl(n, F) acts on each of these a natural way: 

1 . The primary action on F" is a left action and corresponds to the standard 
representation and is simply multiplication from the left: (3, v) —>gv. 

2 . The primary action on (F n )* = F„ is also a left action and is (3, v) — > vg ~ 1 
(again matrix multiplication). In a setting where one insists on using 
only column vectors (even for the dual space) then this action appears as 
(3, v*) — > (3 _1 ) t v*. The reader may recognize this as giving the contragra- 
dient representation. 

Differential geometry strives for invariance and so we should try to get away 
from the special spaces V such as F” and F„ which often have a standard 
preferred basis. So let V be an abstract F— vector space and V* its dual. For 
every choice of basis e = (ei, e„) for V there is the natural map u e : F" — » V 
given by e : v 1— > e(v) = Identifying e with the row of basis vectors 

(e 1 , ..., e") we see that u e (v) is just formal matrix multiplication 

u e : v 1— > ev 

( 

— (ei, ..., e n ) I • 

V V n 

Corresponding to e = (ei, ..., e n ) there is the dual basis for V* which we write as 
e* = (e 1 , , e n ). In this case too we have a natural map u e * : F n * := F„ — > V* 
given by 

u e * : v* e- > v*e* 

= {v\...,v n ) 

In each, the definition of basis tells us that e : F” — > V and e* : F„ — > V* are 
both linear isomorphisms. Thus for a fixed frame e each v £ V may be written 
uniquely v = ev while each v* £ V* we have the expansion v* = ve 




Chapter 2 

Differentiable Manifolds 


An undefined problem has an infinite number of solutions. 
-Robert A. Humphrey 


2.1 Rough Ideas I 

The space of n-tuples R™ is often called Euclidean space by mathematicians but 
it might be a bit more appropriate the refer to this a Cartesian space which is 
what physics people often call it. The point is that Euclidean space (denoted 
here as E n ) has both more structure and less structure than Cartesian space. 
More since it has a notion of distance and angle, less because Euclidean space 
as it is conceived of in pure form has no origin or special choice of coordinates. 
Of course we almost always give R" it usual structure as an inner product space 
from which we get the angle and distance and we are on our way to having a 
set theoretic model of Euclidean space. 

Let us imagine we have a pure Euclidean space. The reader should think 
physical of space as it is normally given to intuition. Rene Descartes showed 
that if this intuition is axiomatized in a certain way then the resulting abstract 
space may be put into one to one correspondence with the set of ?r-tuples, the 
Cartesian space R”. There is more than one way to do this but if we want the 
angle and distance to match that given by the inner product structure on R" 
then we get the familiar rectilinear coordinates. 

After imposing rectilinear coordinates on a Euclidean space E n (such as the 
plane E 2 ) we identify Euclidean space with R n , the vector space of n — tuples of 
numbers. In fact, since a Euclidean space in this sense is an object of intuition 
(at least in 2d and 3d) some may insist that to be sure such a space of points 
really exists that we should in fact start with R” and “forget” the origin and 
all the vector space structure while retaining the notion of point and distance. 
The coordinatization of Euclidean space is then just a “remembering” of this 
forgotten structure. Thus our coordinates arise from a map x : E n — > R" which 
is just the identity map. 


15 
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The student must learn how differential geometry is actually done. These 
remarks are meant to encourage the student to stop and seek the simplest most 
intuitive viewpoint whenever feeling overwhelmed by notation. The student is 
encouraged to experiment with abbreviated personal notation when checking 
calculations and to draw diagrams and schematics that encode the geometric 
ideas whenever possible. The maxim should be “Let the picture write the equa- 
tions” . 

Now this approach works fine as long as intuition doesn’t mislead us. But 
on occasion intuition does mislead us and this is where the pedantic notation 
and the various abstractions can save us from error. 


2.2 Topological Manifolds 

A topological manifold is a paracompact 1 Hausdorff topological space M such 
that every point p £ M is contained in some open set U p which is the domain 
of a homeomorphism (f> : U p — > V onto an open subset of some Euclidean space 
R". Thus we say that M is “locally Euclidean”. Many authors assume that a 
manifolds is second countable and then show that paracompactness follows. It 
seems to the author that paracompactness is the really important thing. In fact, 
it is surprising how far one can go without assume second countability. This 
has the advantage of making foliations (defined later) manifolds. Nevertheless 
we will assume also second countability unless otherwise stated. 

It might seem that the n in the definition might change from point to point 
or might not even be a well defined function on M depending essentially on the 
homeomorphism chosen. However, this in fact not true. It is a consequence of a 
fairly difficult result of Brower called “invariance of domain” that the “dimen- 
sion” n must be a locally constant function and therefore constant on connected 
manifolds. This result is rather trivial if the manifold has a differentiable struc- 
ture (defined below). We shall simply record Brower’s theorem: 

Theorem 2.1 (Invariance of Domain) The image of an open set U C R" 

by a 1-1 continuous map f : U — > R™ is open. It follows that if U C R" is 
homeomorphic to V C R 171 then m = n. 

Each connected component of a manifold could have a different dimension 
but we will restrict our attention to so called “pure manifolds” for which each 
component has the same dimension which we may then just refer to as the 
dimension of M. The latter is denoted dim(M). A topological manifold 
with boundary is a second countable Hausdorff topological space M such 
that point p € M is contained in some open set U p which is the domain of a 
homeomorphism if : U — > V onto an open subset V of some Euclidean half 
space R" =: {x : x l < 0} 2 . A point that is mapped to the hypersurface 
R” =: {x : x 1 = 0} under one of these homeomorphism is called a boundary 

1 Paracompact means every open cover has a locally finite refinement. 

2 Using - ^ =: {x : x 1 > 0} is equivalent at this point in the development and is actually 
the more popular choice. Later on when we define orientation on a (smooth) manifold this 
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point. As a corollary to Brower’s invariance of domain theorem this concept is 
independent of the homeomorphism used. The set of all boundary points of M is 
called the boundary of M and denoted dM. The interior is int(M) := M — dM. 

Topological manifolds are automatically normal and paracompact. This 
means that each topological manifold supports C°— partitions of unity: Given 
any cover of M by open sets {U a } there is a family of continuous functions {0,} 
whose domains form a cover of M such that 

(i) supp(/?j) C U a for some a, 

(ii) each p G M is contained in a neighborhood which intersect the support of 
only a finite number of the /3,. 

(iii) we have Pi = 1 (notice that the sum Y^Pi( x ) is finite for each p £ M 
by (ii))- 

Remark 2.1 For differentiable manifolds we will be much more interested in 
the existence of smooth partitions of unity. 

2.3 Differentiable Manifolds and Differentiable 
Maps 

The art of doing mathematics consists in finding that special case 
which contains all the germs of generality. Hilbert, David (1862-1943) 

Definition 2.1 Let M be a topological manifold. A pair (17, x) where U is an 
open subset of M and x : U — > R n is a homeomorphism is called a chart or 
coordinate system on M . 

If (17, x) chart (with range in R n ) then x = (x 1 , ...., x n ) for some functions 
x l (i = l,...,n) defined on U called coordinate functions. To be precise, we 
are saying that if pi : R n — > K. is the obvious projection onto the <-th factor 
of R n := Ix-'Xl then x l := p, o x and so for p £ M we have x(p) = 
{x 1 (p) 1 ....,x n (p)) <= R n . 

By the very definition of topological manifold we know that we may find a 
family of charts {(x Q , U a )} ae A whose domains cover M; that is M = U ae AU a - 
Such a cover by charts is called an atlas for M. It is through the notion of 
change of coordinate maps (also called transition maps or overlap maps etc.) 
that we define the notion of a differentiable structure on a manifold. 

Definition 2.2 Let A = {(x a ,U a )} a ^A be an atlas on a topological manifold 
M . Whenever the overlap U a fl Up between two chart domains is nonempty we 
have the change of coordinates map x^oxj 1 : x a (U a fl Up) — * xp(U a fl Up). 
If all such change of coordinates maps are C r -diffeomorphisms then we call the 
atlas a C r -atlas. 

“negative” half space will be more convenient since we will be faced with less fussing over 
minus signs. 
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Now we might has any number of C r -atlases on a topological manifold but 
we must have some notion of compatibility. A chart (U, x) is compatible with 
some atlas A = {(x a ,U a )} ae A on M if the maps x o x” 1 : x a (U a fl U) — > 
x(U a n U ) are C r -diffeomorphisms defined. More generally, two C r -atlases 
A = {(x a , U a )} a £A and A' = {x a /, U a >} a > e A' are said to be compatible if the 
union AoA' is a C r - atlas. It should be pretty clear that given a C r -atlases 
on M there is a unique maximal atlas that contains A and is compatible with 
it. Now we are about to say a CA-atlas on a topological manifold elevates it 
too the status C r -differentiable manifold by giving the manifold a so-called C r - 
structure (smooth structure) but there is a slight problem or two. First, if two 
different atlases are compatible then we don’t really want to consider them to 
be giving different C r -structures. To avoid this problem we will just use our 
observation about maximal atlases. The definition is as follows: 

Definition 2.3 A maximal C r -atlas for a manifold M is called a C r -differentiable 
structure. The manifold M together with this structure is called a C r -differentiable 
manifold. 

Now note well that any atlas determine a unique CA-differentiable structure 
on M since it determine the unique maximal atlas that contains it. So in 
practice we just have to cover a space with mutually C' r -compatible charts in 
order to turn it into (or show that it has the structure of) a C r -differentiable 
manifold. In practice, we just need some atlas or maybe some small family of 
atlases that will become familiar to the reader for the most commonly studied 
smooth manifolds. For example, the space R n is itself a C°° manifold (and 
hence a CA-manifold for any r > 0) as we can take for an atlas for R n the 
single chart (id, R n ) where id : R™ — » R™ is just the identity map id( x) = x. 
Other atlases may be used in a given case and with experience it becomes 
more or less obvious which of the common atlases are mutually compatible and 
so technical idea of a maximal atlas usually fades into the background. For 
example, once we have the atlas {(id, R 2 )} on the plane (consisting of the single 
chart) we have determined a differentiable structure on the plane. But then 
the chart given by polar coordinates is compatible with later atlas and so we 
could though this chart into the atlas and “fatten it up” a bit. In fact, there are 
many more charts that could be thrown into the mix if we needed then because 
in this case any local diffeomorphism U C R 2 — > R 2 would be compatible with 
the “identity” chart (id, R 2 ) and so would also be a chart within the same 
differentiable structure on M 2 . By the way, it is certainly possible for there to 
be two different differentiable structures on the same topological manifold. For 
example the chart given by the cubing function (x i— > a: 3 ,!! 1 ) is not compatible 
with the identity chart (id, R 1 ) but since the cubing function also has domain 
all of R 1 it too provides an atlas. But then this atlas cannot be compatible 
with the usual atlas {(id, R 1 )} and so they determine different maximal atlases. 
Now we have two different differentiable structures on the line R 1 . Actually, 
the two atlases are equivalent in a sense that we will make precise below (they 
are diffeomorphic) . We say that the two differentiable structures are different 
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but equivalent or diffeomorphic. On the other hand, it is a deep result proved 
fairly recently that there exist infinitely many non-diffeomorphic differentiable 
structures on R 4 . The reader ought to be wondering what is so special about 
dimension four. 


Example 2.1 Each Euclidean space R” is a differentiable manifold in a trivial 
way. Namely, there is a single chart that forms an atlas 3 which is just the 
identity map R n — > R". Notice however that the map e : (a: 1 , x 2 , ..., x n ) i— > 
((ad) 1 / 3 , x 2 , ... ,x n ) is also a chart. Thus we seem to have two manifolds R™,.4i 
and R",^42- This is true but they are equivalent in another sense. Namely, 
they are diffeomorphic via the map e. See definition 2.13 below. Actually, ifV 
is any vector space with a basis (/i, /„) and dual basis (/*, /*) then once 
again, we have an atlas consisting of just one chart defined on all of V which 
is the map x : v i— > (/*v, ...,/*v) G R". On the other hand V may as well be 
modelled (in a sense to be defined below) on itself using the identity map as the 
sole member of an atlas! The choice is a matter of convenience and taste. 


Example 2.2 The sphere S 2 C R 3 . Choose two points as north and south poles. 
Then off of these two pole points and off of a single half great circle connecting 
the poles we have the usual spherical coordinates. We actually have many such 
systems of spherical coordinates since we can re-choose the poles in many dif- 
ferent ways. We can also use projection onto the coordinate planes as charts. 
For instance let Uf be all ( x , y , z) £ S 2 such that z > 0. Then ( x , y, z) i— > (x, y) 
provides a chart Uf — > R 2 . The various transition functions can be computed 
explicitly and are clearly smooth. We can also use stereographic projection 
to give charts. More generally, we have the n-sphere S n C R” +1 with two charts 
U + ,ip + and where 

U'* \p= (aq, x n+ i) G S n : x n+1 ±1} 


and f> + (resp. if-) is stereographic projection from the north pole (0,0. ...0,1) 
(resp. south pole (0,0, ... ,0,-1)). Explicitly we have 


V»+(p) 

tf-(p) 


(l-x n+1 ) {Xu 
(1 + a;„+i) ^ Xl ’ 


■■■; X n ) G R" 

o,) e R” 


(2.1) 


Exercise 2.1 Compute otpff and o ip + . 

Example 2.3 The set of all lines through the origin in R 3 is denoted P 2 (R) 
and is called the real projective plane . Let U z be the set of all lines £ G P 2 (R) 

3 Of course there are many other compatible charts so this doesn’t form a maximal atlas 
by a long shot. 
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not contained in the x, y plane. Every line p £ U z intersects the plane z = 1 
at exactly one point of the form (x(£),y(£), 1). We can define a bijection ip z ■ 
U z — » R 2 by letting p i— > (a :(£),y(£)). This is a chart for P 2 (R) and there 
are obviously two other analogous charts ip x , U x and ipy, U y which cover P 2 (R). 
More generally, the set of all lines through the origin in R " +1 is called projective 
n-space denoted P n (R) and can be given an atlas consisting of charts of the form 
’ifi, Ui where 


Ui = {l £ P„(R) : £ is not contained in the hyperplane x 1 = 0 
ipi(£) = the unique coordinates (1 r,...,u n ) such that (u 1 , ..., 1, u n ) is 
on the line £. 


Example 2.4 The graph of a smooth function f : R™ — > R is the subset of the 
Cartesian product R™ x R given by T f = {(a:, f(x)) : x € R"}. The projection 
map r f — > R" is a homeomorphism and provides a global chart on T f making 
it a smooth manifold. More generally, let S C R” +1 be a subset which has the 
property that for all x £ S there is an open neighborhood U C R" +1 and some 
function f :: R” — » R such that U n S consists exactly of the points of in U of 
the form 

(x 1 ,.„x 3 ~ 1 ,f(x 1 ,,.., xi, ..,x n+1 ),x J+1 ,...,x n ). 

Then on U D S the projection 


(x 1 , -,x J 1 ,f(x 1 ,...,xh..,x n+1 ),x J+1 ,...,x n ) 1 ^ (a; 1 , 


~J - 1 M+i 


,x n ) 


is a chart for S. In this way, S is a differentiable manifold. Notice that S is 
a subset of the manifold R " +1 and the manifold topology indu??ced by the atlas 
just described is the same as the relative topology of S in R n+1 . The notion of 
regidar submanifold generalizes this idea to arbitrary smooth manifolds. 


Example 2.5 The set of all m x n matrices M mxn (also written R”^ is an 
urn-manifold modelled on R m ™. We only need one chart, again since it clear 
that M mx „ is in natural 1-1 correspondence with R mn by the map [fl^] 1 — > 
(an, a± 2 , ...., a m „). Also, the set of all non-singular matrices GL(n,R) is an 
open submanifold o/M„ xn = R" . 


If we have two manifolds Mi and M 2 we can form the topological Cartesian 
product M\ x M2. We may give Mi x M 2 a differentiable structure that induces 
this same product topology in the following way: Let Am-, and Am 2 be atlases 
for Mi and M2. Take as charts on Mi x M2 the maps of the form 

x Q x y 7 : U a x Ey — > R " 1 x R " 2 

where (x a ,U a ) is a chart form Am-, and y~ 1 ,V 1 a chart from Am 2 - This gives 
Mi x M2 an atlas called the product atlas which induces a maximal atlas and 
hence a differentiable structure. 
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Example 2.6 The circle is clearly a manifold and hence so is the product T = 
S 1 x S' 1 which is a torus. 

Example 2.7 For any manifold M we can construct the “ cylinder ” M x / 
where I is some open interval in R. 


2.4 Pseudo-Groups and Models Spaces 

Without much work we can reformulate our definition of a manifold to include 
as special cases some very common generalizations which include complex mani- 
folds and manifolds with boundary. If fact, we would also like to include infinite 
dimensional manifolds where the manifolds are modelled on infinite dimensional 
Banach spaces rather than R". It is quite important for our purposes to realize 
that the spaces (so far just R") which will be the model spaces on which we 
locally model our manifolds should have a distinguished family of local home- 
omorphisms. For example, C r — differentiable manifolds are modelled on R" 
where on the latter space we single out the local C r — diffeomorphisms between 
open sets. But we also study complex manifolds, foliated manifolds, manifolds 
with boundary, Hilbert manifolds and so on. Thus we need appropriate model 
space but also, significantly, we need a distinguished family on maps on the 
model space. Next we are going to define the notion of a transformation pseu- 
dogroup which will be a family of maps with certain properties. The definition 
will seem horribly complex without first having something concrete in mind so 
we first single out a couple of examples that fit the abstract pattern we are after. 
The first one is just the set of all diffeomorphisms between open subsets of R" 
(or any manifold). The second one, based on an example in the article [We4], 
is a bit more fanciful-a sort of “toy pseudogroup”. Consider the object labelled 
“The model M” in figure 2.1. Consider this set as made of tiles and their edges 
(grout between the tiles plus the outer boundary). Let T be the set of all maps 
from open sets of the plane to open sets of the plane that are restrictions of 
rigid motions of the plane. The we take as our example L toy := T| M which is 
the homeomorphisms of (relatively) open sets of M to open sets of M which are 
restrictions of the maps in T (to the intersections of their domains with M). 

Definition 2.4 A pseudogroup of transformations, sayT, of a topological 
space X is a family {4> 7 } 76 / of homeomorphisms with domain U 1 and range V 1 
both open subsets of X, which satisfies the following properties: 

1) idx e T. 

2) 4> 7 £ r implies <I>“ 1 € T. 

3) For any open set U C X, the restrictions < f > 7 | tf are in T for all 4> 7 € T. 

4) The composition of any two elements 4> 7 , 4V £ T are elements of T 
whenever the composition is defined: 

$ 7 o : $„(t/ 7 n U„) -► $ 7 (t/ 7 n U v ) 
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The model M 

Figure 2.1: Tile and grout spaces 


5) For any subfamily { < f > 7 } 7 £G 1 C T such that $ 7 | [/ nu = v nu when- 
ever U 1 fl t/„ 7 ^ 0 then the mapping defined by $ : U 7 eGi ^7 ~ * U 7 <=Gi ^7 * s an 
element of T if it is a homeomorphism. 

Exercise 2.2 Check that each of these axioms is satisfied by our “ fanciful ex- 
ample” r ioy • 

Definition 2.5 A sub-pseudogroup E of a pseudogroup is a subset ofT that 
is also a pseudogroup (and so closed under composition and inverses). 

We will be mainly interested in C r -pseudogroups and the spaces which sup- 
port them. Our main example will be the set r^„ of all C r diffeomorphisms 
between open subsets of More generally, for a Banach space B we have the 
C' r -pseudo-group Fg consisting of all C r diffeomorphisms between open subsets 
of a Banach space B. Since this is our prototype the reader should spend some 
time thinking about this example. 

Definition 2.6 A C r — pseudogroup T of transformations of a subset M of Ba- 
nach space B is a pseudogroup arising as the restriction to M of some sub- 
pseudogroup ofT r M . The pair (M,r) is called a model space . As is usual in 
cases like this we sometimes just refer to M as the model space. 

Example 2.8 Recall that a map U C C n — > C n is holomorphic if the derivative 
(from the point of view of the underlying real space is in fact complex linear. 
A map holomorphic map with holomorphic inverse is called biholomorphic. The 
set of all biholomorphic maps between open subsets of C" is a pseudogroup. 
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This is a C r -pseudogroup for all r including r = u>. In this case the subset M 
we restrict to is just C n itself. 

Let us begin again redefine a few notions in greater generality. Let M be 
a topological space. An M-chart on M is a homeomorphism x whose domain 
is some subset U C M and such that x(U) is an open subset (in the relative 
topology) of a fixed model space McB. 

Definition 2.7 Let T be a C r -pseudogroup of transformations on a model space 
M. A r -atlas, for a topological space M is a family of charts Ar = {(x Q , U a )} a ^A 
(where A is just an indexing set) which cover M in the sense that M = U a eA 
and such that whenever U a nUp is not empty then the map 

X/3 o x” 1 : x a (U a l~l Up) -> xp(U a l~l Up) 

is a member of T. The maps xp ox” 1 are called various things by various 
authors including “transition maps”, “coordinate change maps”, and “overlap 
maps”. 

Now the way we set up the definition the model space M is a subset of a 
Banach space. If M the whole Banach space (the most common situation) and 
if r = rj^ (the whole pseudogroup of local C^diffeomorphisms) then we have 
what was before called a C r atlas M. 

Exercise 2.3 Show that this definition of C r atlas is the same as our original 
definition in the case where M is the finite dimensional Banach space R n . 

In practice, a T— manifold is just a space M (soon to be a topological man- 
ifold) together with an T— atlas A but as before we should tidy things up a bit 
for our formal definitions. First, let us say that a bijection onto an open set in 
a model space, say x : U — > x(U) C M, is compatible with the atlas A if for 
every chart (x a , U a ) from the atlas A we have that the composite map 

x o x” 1 : x a (U a DU)-* xp(U a n U) 

is in r. The point is that we can then add this map in to form a larger equivalent 
atlas: A! — A U {x, U}. To make this precise let us say that two different T 
atlases, say A and B , are equivalent if every map from the first is compatible 
(in the above sense) with the second and visa- versa. In this case A! = A U B is 
also an atlas. The resulting equivalence class is called a T— structure on M. 

Now it is clear that every equivalence class of atlases contains a unique 
maximal atlas which is just the union of all the atlases in the equivalence 
class. Of course every member of the equivalence class determines the maximal 
atlas also-just toss in every possible compatible chart and we end up with the 
maximal atlas again. Since the atlas was born out of the pseudogroup we might 
denote it by Ar and say that it gives M a 
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Definition 2.8 A topological manifold M is called a C r — differentiable man- 
ifold (or just C r manifold) if it comes equipped with a differentiable structure. 
Whenever we speak of a differentiable manifold we will have a fixed differentiable 
structure and therefore a maximal C r — atlas Am in mind. A chart from Am 
will be called an admissible chart. 

We started out with a topological manifold but if we had just started with a 
set M and then defined a chart to be a bijection x : U — ■> x(C/), only assuming 
x(f7) to be open then a maximal atlas Am would generate a topology on M . 
Then the set U would be open. Of course we have to check that the result is 
a paracompact space but once that is thrown into our list of demand we have 
ended with the same notion of differentiable manifold. To see how this approach 
would go the reader should consult the excellent book [A,B,R]. 

In the great majority of examples the subset M C V is in fact equal to V 
itself. One important exception to this will lead us to a convenient formulation 
of manifold with boundary. First we need a definition: 

Definition 2.9 Let A G M*. In the case of R" it will be enough to consider 
projection onto the first coordinate x 1 . Now let Mj = {x£ M: A(x) > 0} and 
Mjj = {xe M; A(x) < 0} and <9Mj( = = {x G M: A(x) = 0} is the kernel 

of X. Clearly Mj( and M)C are homeomorphic and <9M^ is a closed subspace. 4 

Example 2.9 LetT r M _ be the restriction to of the set of C r -diffeomorphisms 

<fi from open subset of M to open subsets of M which have the following property 

*) If the domain U of <f> € T^ has nonempty intersection with Mo := {x G M; 
A(x) = 0} then <^| M 0 nt/ : Mq n U — -> Mo n U . 

The model spaces together with an associated C r -pseudogroup will be the 
basis of many of our geometric construction even if we do not explicitly mention 
it 

Notation 2.1 Most of the time we will denote the model space for a manifold 
M (resp. N etc.) by M (resp. N etc.) That is, we use the same letter but use the 
sans serif font (this requires the reader to be tuned into font differences). There 
will be exceptions. One exception will be the case where we want to explicitly 
indicate that the manifold is finite dimensional and thus modelled on K" for 
some n. Another exception will be when E is the total space of a vector bundle 
over M . In this case E will be modelled on a space of the form M x E. This 
will be explained in detail when study vector bundles. 

Now from the vantage point of this general notion of model space and the 
spaces modelled on them we get a slick definition of manifold with bound- 
ary. A topological manifold M is called a C r — differentiable manifold with 
boundary (or just C r manifold with boundary) if it comes equipped with a 

ri\_ — structure. 

M x 

4 The reason we will use both E+ and E~ in the following definition for a technical reason 
having to do with the consistency of our definition of induced orientation of the boundary. 
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Remark 2.2 It may be the case that there are two or more different differen- 
tiable structures on the same topological manifold. But see remark 2. 4 below. 

Notice that the model spaces used in the definition of the charts were as- 
sumed to be a fixed space from chart to chart. We might have allowed for 
different model spaces but for topological reasons the model spaces must have 
constant dimension ( < oo) over charts with connected domain in a given con- 
nected component of M. In this more general setting if all charts of the manifold 
have range in a fixed M (as we have assumed) then the manifold is said to be a 
pure manifold and is said to be modelled on M. If in this case M = R" for 
some (fixed) n < oo then n is the dimension of M and we say that M is an 
n-dimensional manifold or n-manifold for short. 

Convention Because of the way we have defined things all differentiable man- 
ifolds referred to in this book are assumed to be pure. We will denote the 
dimension of a (pure) manifold by dim(M). 

Definition 2.10 A chart (x, U) on M is said to be centered at p if x(p) = 

0 G M. 

If U is some open subset of a differentiable manifold M with atlas Am, then 
U is itself a differentiable manifold with an atlas of charts being given by all the 
restrictions (x Q | v nu ,U a C\U ) where (x a , U a ) G Am- We call refer to such an 
open subset U C M with this differentiable structure as an open submanifold 
of M. 

Example 2.10 Each Banach space M is a differentiable manifold in a trivial 
way. Namely, there is a single chart that forms an atlas 5 which is just the 
identity map M — > M. In particular R n with the usual coordinates is a smooth 
manifold. Notice however that the map e : (x 1 , x 2 , ..., x n ) i— > ((a; 1 ) 1 / 3 , x 2 , ..., x n ) 
is also a chart. It induces the usual topology again but the resulting maximal 
atlas is different! Thus we seem to have two manifolds R", A\ and R", A 2 - This 
is true but they are equivalent in another sense. Namely, they are diffeomorphic 
via the map e. See definition 2.13 below. Actually, ifV is any vector space with 
a basis (fi,...,f n ) and dual basis (/*,...,/*) then one again, we have an atlas 
consisting of just one chart, define on all ofV defined by x : v i— > (/(v, ..., /*v) G 
R n . On the other hand V may as well be modelled on itself using the identity 
map! The choice is a matter of convenience and taste. 

If we have two manifolds Mi and M 2 we can form the topological Cartesian 
product Mi x M 2 . We may give M 1 x M 2 a differentiable structure that induces 
this same product topology in the following way: Let Am x and Am 2 be atlases 
for Mi and M 2 . Take as charts on Mi x M 2 the maps of the form 

x a x y 7 : t/ a x + Mi x M 2 

5 Of course there are many other compatible charts so this doesn’t form a maximal atlases 
by a long shot. 
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where (x a ,U a ) is a chart form Am x and y~f,Vy a chart from Am 2 - This gives 
Mi x AI-2 an atlas called the product atlas which induces a maximal atlas and 
hence a differentiable structure. Thus we have formed the product manifold 
Mi x M2 where it is understood that the differentiable structure is as described 
above. 

It should be clear from the context that Mi and M 2 are modelled on Mi 
and M 2 respectively. Having to spell out what is obvious from context in this 
way would be tiring to both the reader and the author. Therefore, let us forgo 
such explanations to a greater degree as we proceed and depend rather on the 
common sense of the reader. 


2.5 Smooth Maps and Diffeomorphisms 

A function defined on a manifold or on some open subset is differentiable by 
definition if it appears differentiable in every coordinate system which intersects 
the domain of the function. The definition will be independent of which coor- 
dinate system we use because that is exactly what the mutual compatibility of 
the charts in an atlas guarantees. To be precise we have 

Definition 2.11 Let f : O C M — *1 fea function on M with open domain 
O. We say that f is C r -differentiable iff for every admissible chart U,x with 
U flO ^ 0 the function 

fox - 1 : x([/nO) 


is C r -differentiable. 

The reason that this definition works is because if U, x, U, x are any two 
charts with domains intersecting O then 

f ox- 1 = (f o x- 1 ) o(xo x- 1 ) 

we have whenever both sides are defined and since xox -1 is a C r — diffeomorphism, 
we see that /ox -1 is C r if and only if / o x” 1 is C r The chain rule is at work 
here of course. 

Remark 2.3 We have seen that when we compose various maps as above the 
domain of the result will in general be an open set which is the largest open set 
so that the composition makes sense. If we do not wish to write out explicitly 
what the domain is then will just refer to the natural domain of the composite 
map. 

Definition 2.12 Let M and N be C r manifolds with corresponding maximal 
atlases Am and An and modelled on R" and R. d respectively . A continuous map 
f : M — > N is said to be k times continuously differentiable or C' if for 
every choice of charts (x, U) from Am and (y, V) from An the composite map 


d 
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is C r on its natural domain (see convention 26. fl). The set of all C r maps 
M — > N is denoted C r (M,N) or sometimes C r {M — > N). 

Exercise 2.4 Explain why this is a well defined notion. Hint: Think about the 
chart overlap maps. 

Sometimes we may wish to speak of a map being C r at a point and for 
that we have a modified version of the last definition: Let M and N be C r 
manifolds with corresponding maximal atlases Am and An and modelled on 
R" and R d respectively. A (pointed) map / : (. M,p ) — > (N,q) is said to be k 
times continuously differentiable or C r at p if for every choice of charts 
(x, U) from Am and (y, V) from An containing p and q = f(p) respectively, the 
composite map 

yo/ox -1 :: (K", x(p)) -> ( R d , y (q)) 

is C r on some open set containing 

Just as for maps between open sets of Euclidean spaces we have 

Definition 2.13 A bijective map f : M —> N such that f and f~ l are C r 
with r > 1 is called a C r -diffeomorphism. In case r = oo we shorten C°°- 
diffeomorphism to just diffeomorphism. The group of all diffeomorphisms of 
a C°° onto itself is denoted Diff(M). 

Example 2.11 The map re : S 2 — > S 2 given by r$(x,y,z) = (xcos6—ysm0,xsm 
ycosO, z) for x 2 + y 2 + z 2 = 1 is a diffeomorphism (and also an isometry). 

Example 2.12 The map f : S 2 — > S 2 given by f(x,y,z) = (xcos((l — z 2 )0) — 
j/sin((l — z 2 )#),a;sin((l — z 2 )9) + y cos((l — z 2 )9),z) is also a diffeomorphism 
(but not an isometry). Try to picture this map. 

Definition 2.14 C r differentiable manifolds M and N will be called C r diffeo- 
morphic and then said to be in the same C r diffeomorphism class iff there is a 
C r diffeomorphism f : M — > N. 

Remark 2.4 It may be that the same underlying topological space M carries 
two different differentiable structures and so we really have two differentiable 
manifolds. Nevertheless it may still be the case that they are diffeomorphic. 
The more interesting question is whether a topological manifold can carry dif- 
ferentiable structures that are not diffeomorphic. It turns out that R 4 carries 
infinitely many pair-wise non- diffeomorphic structures (a very deep and difficult 
result) but R. k for k > 5 has only one diffeomorphism class. 

Definition 2.15 A map f : M — > N is called a local diffeomorphism iff every 
point p £ M is in an open subset U p C M such that f\ v : U p — > f(U) is a 
diffeomorphism. (note?to?self: Must we assume f(U) is open?) 

Example 2.13 The map tt : S 2 — > MP 2 given by taking the point ( x,y,z ) to 
the line through this point and the origin is a local diffeomorphism but is not a 
diffeomorphism since it is 2-1 rather than 1-1. 
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Example 2.14 If we integrate the first order system of differential equations 
with initial conditions 

y = x' 
y' = x 
z(o) = i 
y(o) = e 

we get solutions 

x (t\ £, 9) = (U + U) e* — e _t 

y (t; £, 0 ) = (\e + e* + (§0 - e- 4 

that depend on the initial conditions (£, 9) . Now for any t the map <f>t : (£, 9) i— > 
(x(t,(;,9),y(t,£,9)) is a diffeomorphism R 2 — * R 2 . This is a special case of a 
moderately hard theorem. 

Example 2.15 The map (x,y) ( i- z \ x , y ) i-z( a ^) 2/) where z(x,y) = sj 1 - x 2 - y 2 

is a diffeomorphism from the open disk 11(0,1) = {(x,y) : x 2 + y 2 <1} onto 
the whole plane. Thus 11(0, 1) and M 2 are diffeomorphic and in this sense the 
“ same ” differentiable manifold. 

We shall often need to consider maps which are defined on subsets S C M 
that are not necessarily open. We shall call such a map / smooth (resp. C r ) if 
at each point it locally as restriction of a smooth map f defined on an open set 
O containing S. In particular a curve defined on a closed interval [a, b } is called 
smooth if it has a smooth extension to an open interval containing [a,b\. We 
will occasionally need the following simple concept: 

Definition 2.16 A continuous curve c: [a. b] M into a smooth manifold is 
called piecewise smooth is there exists a partition a = to < 1 i < • • • < tk = b 
such that c restricted to each [U,ti+ 1 ] is smooth 
for 0 < i < k — 1. 

Before going to the next section let us compare local expressions in index 
notation with the index free notation (the latter being amenable to infinite 
dimensions). Consider an arbitrary pair of charts yancl x^ 1 and the transition 
maps y o x" 1 : x(U fl V) — > y (U fl V). We write 

yip) = y°x _ 1 (x(p)) 

for p £ U C\V. For finite dimensional manifolds we see this written as 

y\p) = y l {x l {p) 1 ...,x n {p)) (2.2) 

which make sense but we also see 

y l = y l i x 1 ,...,x n ). (2.3) 

In this last expression one might wonder if the x l are functions or numbers. 

But this ambiguity is sort of purposeful for if 2.5 is true for all p £ U fl V then 
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2.5 is true for all (x 1 , ...,x n ) € x(U fl V) and so we are unlikely to be led into 
error. This common and purposely notational ambiguity is harder to pull of in 
the case of index free notation. We will instead, write two different expressions 
in which the lettering and fonts are intended to be at least reminiscent of the 
classical index notation: 


y(p) = y ox 1 (*(p)) 

and 

y = yox“ 1 (x). 


In the first case, x and y are functions on U fl V while in the second x and y 
are elements of x{U fl V) and y(U fl V) respectively 6 . In order not to interfere 
with our subsequent development let us anticipate the fact that this notational 
principle will be manifest later when we compare and make sense out of the 
following familiar looking expressions: 


dy(0 = 


dy 

dx 


' dx(£) 


and 


dy 

dx 


x(p) 


x(p) 


which should be compared with the classical expressions 


dy \ 0 


dy i 

dx k 


and 


dx k { 0 


dy_ v ‘ 

dx kl 


2.6 Coverings and Discrete groups 

2.6.1 Covering spaces and the fundamental group 

In this section and later when we study fiber bundles many of the results are 
interesting and true in either the purely topological category or in the differen- 
tiable category. In order to not have to do things twice let us agree to mean by 
C r — manifold if r > 1 and if r = 0 simply a paracompact Hausdorff topological 
space in case r = 0. All relevant maps are to be C r where if r = 0 we just mean 
continuous. 

“(7°— diffeomorphism” = C °— isomorphism = homeomorphism 

“C 0 — manifold” = Hausdorff space 

C-group = topological group 


®Notice the font differences. 
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Figure 2.2: The line covers the circle. 


In this section we recall a few facts about the fundamental group and covering 
spaces. In order to unify the presentation let us agree that “ C r diffeomorphism” 
just means homeomorphism in case r = 0. Of course a C° map is just a 
continuous map. Also, much of what we do for C° maps works for more general 
topological spaces and so the word “manifolds” could be replaced by topological 
space although the technical condition of being “locally simply con?nected” 
(LSC) is sometimes needed. All manifolds are LSC. 

We may define a simple equivalence relation on a topological space by declar- 
ing 

p ~ q <t=> there is a continuous curve connecting p to q. 

The equivalence classes are called path components and if there is only one such 
class then we say that M is path connected. The following exercise will be used 
whenever needed without explicit mention: 

Exercise 2.5 The path components of a manifold are exactly the connected 

components of M . Thus, a manifold is connected if and only if it is path con- 

nected. 

Definition 2.17 Let M and M be C r — spaces. A surjective C r map p : M —> 
M is called a C r covering map if every point p £ M has an open connected 
neighborhood U such that each connected component Ui of p -1 (f7) is C r dif- 
feomorphic to U via the restriction p|p : Ui — > U. We say that U is evenly 

covered. The triple (M, p, M) is called a covering space. We also refer to the 

space M (somewhat informally) as a covering space for M. 

We are mainly interested in the case where the spaces and maps are smooth. 
In this case we call M (informally) a covering manifold. 

Example 2.16 The map R. — » S 1 given by t i— > e lt is a covering. If e lS £ S 1 
(0 < 6 < 2i r) then the points of the form {e lt : 9 — n < t, < 6 + it} is an open 
set evenly covered by the intervals I n in the real line given by I n := (9 — tt + 
n2ir, 9 + n + n2n) . 
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Exercise 2.6 Explain why the map (—2ir,2n) — > S' 1 given by t i— > e lt is not a 
covering map. 

The set of all C r covering spaces are the objects of a category. A morphism 
between covering spaces, say (Mi, pi, Mi) and (M 2 ,p 2 ,M 2 ) is a pair of maps 
(/, /) which give a commutative diagram 

Mi 4 M 2 

I I 

Mi 4 M 2 

which means that / o pi = p 2 o / . Similarly the set of coverings of a fixed space 
M are the objects of a category where the morphisms are maps $ : Mi — > M 2 
required to make the following diagram commute: 

Mi M 2 

I I 

M ^ M 

so that pi = p 2 o <h. Now let (M, p, M) be a C r covering space. The set of all 
C r — diffeomorphisms $ which are automorphisms in the above category; that 
is, diffeomorphisms for which p = po <f>, are called deck transformations. A 
deck transformation permutes the elements of each fiber p^ x (p). In fact, it is 
easy to see that if U C M is evenly covered then <f> permutes the connected 
components of p -1 (Z 7). 

Proposition 2.1 If p : M — ► M is a C r covering map with M being connected 
then the cardinality of p~ 1 (p) is either infinite or is independent of p. In the 
latter case the cardinality of p _1 (p) is the multiplicity of the covering. 

Proof. Fix fc < oo. Let Uk be the set of all points such that p _1 (p) has 
cardinality k. It is easy to show that Uk is both open and closed and so, since 
M is connected, Uk is either empty or all of M. m 

Definition 2.18 Let a : [0,1] — * M and (3 : [0,1] — » M be two continuous 
(or C r ) maps (paths) both starting at p £ M and both ending at. q. A fixed 
end point ( C r ) homotopy from a to (3 is a family of maps H s : [0, 1] — > M 
parameterized by s £ [0, 1] such that 

1) H : [0, 1] x [0, 1] — > M is continuous (or C r ) where H(t , s) := H s (t), 

2) Hq = a and Hi = (3, 

3) H s { 0) = p and H s ( 1) = q for all s £ [0, 1]. 

Definition 2.19 If there is a [C r ) homotopy from a to (3 we say that, a is 
homotopic to (3 and write a ~ (3 (C r ). 
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It turns out that every continuous path on a C r manifold may be uniformly 
approximated by a C r path. Furthermore, if two C r paths are continuously 
homotopic then they are C r homotopic. Thus we may use smooth paths and 
smooth homotopies whenever convenient. 

It is easily checked that homotopy is an equivalence relation. Let P(p, q) 
denote the set of all continuous paths from p to q defined on [0,1]. Every 
a € P(p,q) has a unique inverse path a defined by 

:= a(l — t). 

If Pi,P 2 and P 3 are three points in M then for a € P{p\,Pi) and [3 £ P(P 2 ,P 3 ) 
we can “multiply” the paths to get a path a * [3 £ P(pi,P 3 ) defined by 

( a(2 1) for 0 < t < 1/2 

a*f3(t) := < 

[ (3{2t - 1) for 1/2 < t < 1 

An important observation is that if ai ~ a 2 and (3\ ~ (32 then 

oq * (3\ ~ «2 * /?2 where the homotopy between aq * (3\ and ct 2 * /?2 is given 
in terms of the homotopy H a : aq ~ a 2 and Hp : (3\ ~ /3 2 by 

r H a (2t,s) for 0 < t < 1/2 
H(t, s ) := < and 0 < s < 1 

[ Hp(2t — 1, s) for 1/2 < t < 1 

Similarly, if oq ~ 02 then ~ . Using this information we can define a 

group structure on the set of homotopy equivalence classes of loops, that is, of 
paths in P(p,p) for some fixed p £ M. First of all, we can always form a * (3 for 
any a, (3 £ P(p,p) since we are always starting and stopping at the same point 
p. Secondly we have the following 

Proposition 2.2 Let 7Ti(M,p) denote the set of fixed end point homotopy classes 
of paths from P(p,p). For [a], [(3} € 7 Ti(M, p) define [a] • \(3\ := [a * (3\. This 
is a well define multiplication and with this multiplication 7 Ti(M, p) is a group. 
The identity element of the group is the homotopy class 1 of the constant map 
l p : t — > p, the inverse of a class [a] is [a^] . 

Proof. We have already shown that [a] • [(3} := [a * (3 ] is well defined. One 
must also show that 

1) For any a, the paths a o and cW o a are both homotopic to the 
constant map l p . 

2) For any a G P(p,p) we have l p * a ~ a and a * l p ~ a. 

3) For any a, (3, 7 £ P(p,p) we have (a * (3) * 7 ~ a * ((3 * 7 ). 

The first two of these are straight forward and left as exercises. L??L ■ 

The group tt\{M,p) is called the fundamental group of M at p. If 7 : 
[0, 1] — > M is a path from p to q then we have a group isomorphism 717 (M, q) — > 
7 r i(M,p) given by 

[a] 1— > [7 * a * 7 <- ]. 

(One must check that this is well defined.) As a result we have 
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Proposition 2.3 For any two points p,q in the same path component of M, 
the groups tt\ (M, p) and ni(M,q) are isomorphic (by the map described above). 

Corollary 2.1 If M is connected then the fundamental groups based at different 
points are all isomorphic. 

Because of this last proposition, if M is connected we may refer to the 
fundamental group of M. 

Definition 2.20 A path connected topological space is called simply connected 
if M (M) = { 1 }. 

The fundamental group is actually the result of applying a functor. To every 
pointed space ( M,p ) we assign the fundamental group 717 (M,p) and to every 
base point preserving map (pointed map) / : ( M,p ) — * (TV, f(p)) we may assign 
a group homomorphism 7Ti (/) : iri(M,p) — > ni(N, f(p)) by 

7r i(/)( M) = [f ° a \- 

It is easy to check that this is a covariant functor and so for composable pointed 
maps (M, x) -4 ( N,y ) ( P,z ) we have 717(50 f) = tti (5)71-1 (/). 

Notation 2.2 To avoid notational clutter we will denote 717 (/) by /#. 

Theorem 2.2 Every connected manifold M has a simply connected covering 
space. Furthermore, if H is any subgroup of tti(M,p), then there is a connected 
covering p : M — > M and a point p £ M such that p#(iti(M ,p)) = H. 


Definition 2.21 Let f : P M be a C r —map. A map f : P — > M is said to 
be a lift of the map f if po f = f . 

Theorem 2.3 Let p : M —> M be a covering ofC r manifolds and 7 : [a, b ] — > M 
a C r — curve and pick a point y in p _1 ( 7(a))- Then there exist a unique C r lift 
7 : [a, b] — > M of 7 such that 7(a) = y. Thus the following diagram commutes. 

M 

7 

/ Ip 

[a, b\ —> M 

Similarly, if h : [a, b] x [c,d\ — > M is a C r —map then it has a unique lift 
h : [a, b] x [c, d\ — > M . 

Proof. Figure ?? shows the way. Decompose the curve 7 into segments 
which lie in evenly covered open sets. Lift inductively starting by using the 
inverse of p in the first evenly covered open set. It is clear that in order to 
connect up continuously, each step is forced and so the lifted curve is unique. 
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The proof of the second half is just slightly more complicated but the idea is 
the same and the proof is left to the curious reader. A tiny technicality in either 
case it the fact that for r > 0 a C r — map on a closed set is defined to mean that 
there is a C r — map on a slightly larger open set. For instance, for the curve 7 
we must lift an extension y ext : (a — s, b+e) — » M but considering how the proof 
went we see that the procedure is the same and gives a C r — extension 7 ext of 
the lift 7 . ■ 

There are two important corollaries to this result. The first is that if a,/3 : 
[0,1] — > M are fixed end point homotopic paths in M and 5 ~ /? are lifts with 
5(0) = (3{ 0) then any homotopy h t : a ~ (3 lifts to a homotopy h t : 5 ~ (3. The 
theorem then implies that 5(1) = /3(1). From this one easily prove the following 

Corollary 2.2 For every [a] £ iri(M,p) there is a well defined map [a]j : 
p _ 1 (p) — * p _ 1 (p) given by letting [a]jj/ be 5(1) for the lift of a with 5(0) = y. 
(Well defined means that any representative o! £ [a] gives the same answer.) 

Now recall the notion of a deck transformation. Since we now have two ways 
to permute the elements of a fiber, on might wonder about their relationship. 
For instance, given a deck transformation <F and a chosen fiber p^ : (p) when do 
we have $| p -i(p) = [a] # for some [a] £ ni(M,p)? 

2.6.2 Discrete Group Actions 

Let G be a group and endow G with the discrete topology so in particular every 
point is an open set. In this case we call G a discrete group. If M is a topological 
space then so is G x M with the product topology. What does it mean for a 
map a : G x M —> M to be continuous? The topology of G x M is clearly 
generated by sets of the form S x U where S is an arbitrary subset of G and 
U is open in M. The map a : G x M — > M will be continuous if for any point 
(9o> x o) G G x M and any open set U C M containing a(go,Xo) we can find an 
open set S x V containing (go,Xo) such that a(S x V) C U. Since the topology 
of G is discrete, it is necessary and sufficient that there is an open V such that 
a(go x V) C U. It is easy to see that a necessary and sufficient condition for a 
to be continuous on all of G x M is that the partial maps a g (.) := a(g, .) are 
continuous for every g £ G. 

Definition 2.22 Let G and M be as above. A (left) discrete group action is a 
map a : G x M — > M such that for every g £ G the partial map a g (.) := a(g, .) 
is continuous and such that the following hold: 

1 ) a(g 2 , a(gi,x)) = ot(g 2 gi, x) for all gi,g 2 £ G and all x £ M. 

2) a(e, x) = x for all x £ M . 

It follows that if a : G x M — > M is a discrete action then each partial map 
a g (.) is a homeomorphism with a~ 1 (.) = a g -i(.). It is traditional to write g ■ x 
or just gx in place of the more accurate a(g,x). Using this notation we have 
g- 2 (gix) = {g 29 i)x and ex = x. 
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Definition 2.23 A discrete group action is C r if M is a C r manifold and each 
a g (.) is a C r map. 

If we have a discrete action a : G x M — > M then for a fixed x, the set 
G • x := {g ■ x : g £ G} is called the orbit of x. It is easy to see that two orbits 
G ■ x and G • y are either disjoint or identical. In fact, we have equivalence 
relation on M where x ~ y iff there exists a g £ G such that gx = y. The 
equivalence classes are none other than the orbits. The natural projection onto 
set of orbits p : M — > M/G given by 


xi —> G • x. 


If we give M/G the quotient topology then of course p is continuous but more 
is true: The map p : M — > M/G is an open map. To see this notice that if 
U C M and we let U := p(U) then p~ 1 ( U) is open since 

p- 1 (U) = \J{gU:geG} 

which is a union of open sets. Now since p _1 ({ 7) is open, U is open by definition 
of the quotient topology. 

Example 2.17 Let <f> : M — > M be a diffeomorphism and let Z act on M by 
n ■ x := (f> n (x ) where 

<t>° := idj ur , 

</>":=</ o • • • o (f> for n > 0 
<f>~ n := (0 -1 )" for n > 0. 

This gives a discrete action ofZ on M. 

Definition 2.24 A discrete group action a : G x M — > M is said to act prop- 
erly if for every for every x € M there is an open set U C M containing x such 
that unless g = e we have gU n U = 0 for all j/e. We shall call such an open 

set self avoiding. 

It is easy to see that if U C M is self avoiding then any open subset V C U 
is also self avoiding. Thus every point x £ M has a self avoiding neighborhood 
that is connected. 

Proposition 2.4 If a : G x M —> M is a proper action and U C M is self 
avoiding then p maps gU onto p{U) for all g £ G and the restrictions p\ gU : 
gU — > p(U) are homeomorphisms. In fact, p : M — > M/G is a covering map. 

Proof. Since U = gU via x i— > gx and since x and gx are in the same orbit, 
we see that gU and U both project to same set p(U). Now if x,y £ gU and 
P\ g u ( x ) = P\gU (v) th en y = hx for some h £ G. But also x = ga (and y = gb) 
for some a,b £ U. Thus h~ 1 gb = x so x £ h~ x gU . On the other hand we also 
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know that x € gU so h~ 1 gU fl gU yf 0 which implies that g~ 1 h~ 1 gU fl U ^ 0. 
Since U is self avoiding this means that g~ 1 h~ 1 g = e and so h = e from which 
we get y = x. Thus p \ gU : gU — > p(U) is 1-1. Now since p\ gU is clearly onto 
and since we also know that p\ gU is an open continuous map the result follows. 

■ 

Example 2.18 Fix a basis (fi,f 2 ) ofM 2 . Let Z 2 act on R 2 by ( m,n ) • (x,y) := 
(x, y) + mil + ni^. This action is easily seen to be proper. 

Example 2.19 Let Z 2 '■= {1, — 1} act on the sphere by (±1) • x := Ax. Thus 
the action is generated by letting —1 send a point on the sphere to its antipode. 
This action is also easily seen to be proper. 

Exercise 2.7 (!) Let a : G x M — > M act properly on M. Show that if U\ 
and U 2 are self avoiding and p{U\) flp(t/ 2 ) yf 0 then there is a g £ G such that 
a g(Ui) fl U 2 yf 0- Show also that if a g is C r then a g maps U\ fl a g -i (C/ 2 ) := 0\ 
diffeomorphically onto a g {Ui) fl U 2 '■= O 2 (homeomorphically if r = 0) and in 
this case 

a s I 01 = p\o\°p\o 1 ■ 

Hint: If U\ and U 2 are self avoiding then so are 0\ and O 2 and p(0 1 ) = 

p(o 2 ). 

Proposition 2.5 Let a : G x M — > M act properly by C r -diffeomorphisms on 
a C r - manifold M. Then the quotient space M/G has a natural C r structure 
such that the quotient map is C r local diffeomorphism. The quotient map is a 
covering map 

Proof. We exhibit the atlas on M/G and then let the reader finish the 
(easy) proof. Let Am be an atlas for M. Let x = Gx be a point in M / G and 
pick an open U C M which contains a point, say x, in the orbit Gx which (as a 
set) is the preimage of x and such that unless g = e we have gUfHJ = 0. Now let 
U a , x Q be a chart on M containing x. By replacing U and U a by U fl U a we may 
assume that x a is defined on U = U a . In this situation, if we let U* := p(U) 
then each restriction p\ lT : U — » U* is a homeomorphism. We define a chart 
map x* with domain U* by 

x* := o p\~l : U* - R". 

Let x* and x^ be two such chart maps with domains U* and C/g. IfC/*n/7gy^0 
then we have to show that x^ o (x* ) _1 is a C r diffeomorphism. Let x £ U*nU^ 
and abbreviate U*p = U*nUp. Since U* fl Up ^ 0 there must be a g £ G such 
that a g (U a ) fl Up ^ 0. Using exercise 2.7ancl letting a g (U a ) fl Up := 0 2 and 
0\ := a g -i 02 we have 

* * I — 1 

x /3 0 x al 

= x d ° p\ol ° ( x « ° pioly 1 

= x /3 0 Plot ° P\o! 0 X a X 

= xpoa g o x" 1 
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which is C r . The rest of the proof is straight forward and is left as an exercise. 

Remark 2.5 In the above, we have suppressed some of information about do- 
mains. For example, what is the domain and range of xp o p\ Q * o p\ 0i o x” 1 ? 
For completeness and in order to help the reader interpret the composition we 
write out the sequence of domains: 

x a (u a n eig-i Up) — > u a n cig-iUp 

^u*nu;^ a g (u a ) nu 0 ^ x p {a g {u a ) n Up). 

The clutter hides a simple idea that would be better expressed using the idea of 
a chart germ: 

Let Xo £ U*nUp and consider the composition of germs of diffeomorphisms 
x*p o (x*) _1 = xpop - 1 o (x Q op ” 1 )- 1 =xpoa g ox' 1 , 


Example 2.20 We have seen the torus as a differentiable manifold previously 
presented as T 2 = S 1 x S 1 . Another presentation that emphasizes the symme- 
tries is given as follows: Let the group Z x Z = Z 2 act on R 2 by 

(to, n) x (x, y) e- > (x + to, y + n). 

It is easy to check that proposition 2.5 applies to give a manifold R 2 /Z 2 . This 
is actually the torus and we have a diffeomorphism </> : R 2 /Z 2 — » S 1 x S 1 = T 2 
given by [(x,j/)] e- > ( e lx ,e ly ). The following diagram commutes: 

R 2 S 1 x S 1 

I / 

R 2 /Z 2 

Covering spaces p : M —> M that arise from a properly discontinuous group 
action are special in that if M is connected then the covering is a normal 
cover?ing. 


2.7 Grassmannian manifolds 

A very useful generalization of the projective spaces is the Grassmannian mani- 
folds. Let G Ui k denote the set of fc-dimensional subspaces of R n . We will exhibit 
a natural differentiable structure on this set. The idea here is the following. An 
alternative way of defining the points of projective space which is as equivalence 
classes of n— tuples (n 1 , ..., v n ) € R” — {0} where (u 1 , ..., v n ) ~ (Ac 1 , ..., Ac”) for 
any nonzero. This is clearly just a way of specifying a line through the origin. 
Generalizing, we shall represent a k— plane as a matrix whose column vectors 
span the k— plane. Thus we are putting an equivalence relation on the set of 
n x k matrices where A ~ Ag for any nonsingular k x k matrix g. 
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To describe this important example we start with the set M„ X fc of n x k 
matrices with rank k < n (maximal rank). The columns of each matrix from 
Mjjxfe span a fc-dimensional subspace of M n . Define two matrices from M nX fc 
to be equivalent if they span the same fc-dimensional subspace. Thus the set 
G(k, n) of equivalence classes is in one to one correspondence with the set of 
real k dimensional subspaces of 

Now let U be the set of all [A] £ G(k, n) such that A has its first k rows 
linearly independent. This property is independent of the representative A 
of the equivalence class [A] and so U is a well defined set. This last fact is 
easily proven by a Gaussian column reduction argument. Now every element 
[A] £ U C G(k,n) is an equivalence class that has a unique member A 0 of the 
form 



Thus we have a map on U defined by T : [A] i— > Z £ M( n _k) x *. = R fc ( n-fe ). We 
wish to cover G(k, n) with sets U a similar to U and defined similar maps. Let 
(Tji ...ifc be the shuffle permutation that puts the k columns indexed by i i, 
into the positions 1 without changing the relative order of the remaining 
columns. Now consider the set Ui 1 ...i k of all [A] € G(k,n) such that any repre- 
sentative A has its k rows indexed by i\, ..., ik linearly independent. The the per- 
mutation induces an obvious 1-1 onto map cr i;L ifc from Ui k ...i k onto U = 

We now have maps : U il „, ik — > M( n _ fe ) xfc = R fc ( n-fc ) given by composi- 
tion := ’FotxqU/. These maps form an atlas f/q. for G(k, n) 

which turns out to be a holomorphic atlas (biholomorphic transition maps) and 
so gives G(k , n) the structure of a smooth manifold called the Grassmannian 
manifold of real /c-planes in R n . Try?graph coordinates! 


2.8 Partitions of Unity 

A partition of unity is a technical tool that is used quite often in connection 
with constructing tensor fields, connections, metrics and other objects out of 
local data. We will not meet tensor fields for a while and the reader may wish 
to postpone a detailed reading of the proofs in this section until we come to our 
first use of partitions of unity and/or so called “bump functions”. Partitions 
of unity are also used in proving the existence of immersions and embeddings; 
topics we will also touch on later. 

It is often the case that we are able to define some object or operation locally 
and we wish to somehow “glue together” the local data to form a globally defined 
object. The main and possible only tool for doing this is the partition of unity. 
For differential geometry it is a smooth partition of unity that we need. 

Definition 2.25 The support of a smooth function is the closure of the set in 
its domain where it takes on nonzero values. The support of a function f is 
denoted supp(/). 
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One of the basic ingredients we will need is the so called “bump function” 
a special case of which was defined in section 7. A bump function is basically a 
smooth function with support inside some prescribed open set. Notice that this 
would not in general be possible for a complex analytic function. 

Lemma 2.1 (Existence of bump functions) Let K be a compact subset of 
R" and U an open set containing K . There exists a smooth function (3 on R" 
which is identically equal to 1 on K, has compact support in U and such that 
0 < /3 < 1. 


Proof. Special case: Assume that U = 13(0,1?.) and K 
case we may take 


<t>(x) 


S\x | 9(*)dt 


where 



0 


if 0 < t < R 
otherwise. 


13(0, r). In this 


General case: Let K C U be as in the hypotheses. Let K % C Ui be 
concentric balls as in the special case above but with various choices of radii 
and such that K C Ulf., and the Ui chosen small enough that Ui C U. Let </>j 
be the corresponding functions provided in the proof of the special case. By 
compactness there are only a finite number of pairs Ki C U t needed so assume 
that this reduction to a finite cover has been made. Examination of the following 
function will convince the reader that it is well defined and provides the needed 
bump function; 

(3(x) = 1 - PJ(1 - 4>i{x)). 

i 


A refinement of an open cover {Up }/} of a topological space is another 
open cover such that every open set from the second cover is contain 
in at least one open set from the original cover. This means that means that 
if [Ufj}p e b is the given cover of X, then a refinement is a cover {Vi },<=/ and a 
set map / — » 13 of the index sets i i— > (3{i) such that Vi C Up^y We say that 
{Vi}i£i is a locally finite cover if in every point of X has a neighborhood that 
intersects only a finite number of the sets from {V)}i 6 /. In other words, the 
every V, is contained in some U g and the new cover has only a finite number of 
member sets that are “near” any given point. 


Definition 2.26 A topological space X is called paracompact if it is Hausdorff 
and if every open cover of X has a refinement to a locally finite cover. 

Definition 2.27 A base (or basis) for the topology of a topological space X is 
a collection of open 2} sets such that all open sets from the topology T are unions 
of open sets from the family 03. A topological space is called second countable 
if its topology has a countable base. 
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Definition 2.28 A topological space is called locally convex if every point has 
a neighborhood with compact closure. 

Note that a finite dimensional differentiable manifold is always locally com- 
pact and we have agreed that a finite dimensional manifold should by assumed 
Hausdorff unless otherwise stated. The following lemma is sometimes helpful. 
It shows that we can arrange to have the open sets of a cover and a locally 
refinement of the cover to be indexed by the same set in a consistent way: 

Lemma 2.2 If X is a paracompact space and is an open cover, then 

there exists a locally finite refinement with Oi C Ui . 

Proof. Let {Vkji^K be a locally finite refinement of with the index 

map k i— > i(k). Let Oi be the union of all Vj- such that i(k) = k. Notice that 
if an open set U intersects an infinite number of the Oi then it will meet an 
infinite number of the V/ c . It follows that is locally finite. ■ 

Theorem 2.4 A second countable, locally compact Hausdorff space X is para- 
compact. 

Sketch of proof. If follows from the hypotheses that there exists a sequence 
of open sets U\, f/ 2 , ....which cover X and such that each Ui has compact closure 
Ui. We start an inductive construction: Set V n = U\ U Ui U ... U U n for each 
positive integer n. Notice that {V^} is a new cover of X and each V n has 
compact closure. Now let 0\ = V t . Since {V n } is an open cover and 0\ is 
compact we have 

^C^U^U.-.U^. 

Next put Oi = V k U Vi 2 U ... U V lk and continue the process. Now we have the 
X is the countable union of these open sets {Oi} and each Oi- 1 has compact 
closure in O,;. Now we define a sequence of compact sets; Ki = Oi \ . 

Now if {Wp}p e B is any open cover of X we can use those Wp which meet K, 
to cover Ki and then reduce to a finite subcover since Ki is compact. We can 
arrange that this cover of Ki consists only of sets each of which is contained 
in one of the sets Wp fl Ch+i and disjoint from Ot-\. Do this for all Ki and 
collect all the resulting open sets into a countable cover for X. This is the desired 
locally finite refinement. ■ 

Definition 2.29 A C r partition of unity on a C r manifold M is a collection 
{Vi, pi] where 

1. {Vi} is a locally finite cover of M; 

2. each pi is a C r function with pi > 0 and compact support contained in Vp 

3. for each x £ M we have Pi{x) = 1 (This sum is finite since {Vi} is 
locally finite.) 
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If the cover of M by chart map domains {U a } of some atlas A = {U a ,x a } 
of M has a partition of unity {Vi, pi} such that each Vi is contained in one of 
the chart domains U a u ) (locally finite refinement), then we say that {Vi, pi} is 
subordinate to A. We will say that a manifold admits a smooth partition 
of unity if every atlas has a subordinate smooth partition of unity. 

Notice that in theorem 2.4 we have proven a bit more than is part of the 
definition of paracompactness. Namely, the open sets of the refinement V) C 
17g(i) have compact closure in Upuy 

Theorem 2.5 Every second countable finite dimensional C r manifold admits 
a C r partition of unity. 

Let M be the manifold in question. We have seen that the hypotheses imply 
paracompactness and that we may choose our locally finite refinements to have 
the compact closure property mentioned above. Let A = {Ui, x,} be an atlas for 
M and let { Wi } be a locally finite refinement of the cover {Ui} with W, C U- t . 
By lemma 2.1 above there is a smooth bump function j3i with supp(/3j) = W j. 
For any x € M the following sum is finite and defines a smooth function: 

Pi x ) = 

i 

Now we normalize to get the needed functions that from the partition of unity: 


Pi = 


fa 

P' 


It is easy to see that pi > 0, and Pi = 1- 


2.9 Manifolds with boundary. 

For the general Stokes theorem where the notion of flux has its natural setting 
we will need to have a concept of a manifold with boundary . A basic example 
to keep in mind the closed hemisphere Si which is the set of all (. x,y,z ) G S 2 
with z > 0. 

Let A € R™* be a continuous from on aEuclidean space R". In the case of 
R™ it will be enough to consider projection onto the first coordinate x 1 . Now 
let R^ + = {x G R": A(®) > 0} and RJ" = {x G R”: A(s) < 0} and 9Rf = 3 
R" _ = {x G R": A(x) = 0} is the kernel of A. Clearly R)( + and R" _ are 
homeomorphic and d R^ + is a closed subspace. The space R? - is the model 
space for a manifold with boundary and is called a (negative) half space. 

Remark 2.6 We have chosen the space R" _ rather than R)( - on purpose. The 
point is that later we will wish to have simple system whereby one of the coordi- 
nate vectors will always be outward pointing at d R" - while the remaining 
coordinate vectors in their given order are positively oriented on d R" _ in a 
sense we will define later. Now, is outward pointing for R”i< 0 but not for 
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R”i >0 . One might be inclined to think that we should look at K" j>0 for some 
other choice of j - the most popular being the last coordinate x n but although this 
could be done it would actually only make things more complicated. The prob- 
lem is that if we declare g|r, g§ 2 , g ± to be positively oriented on R n_1 x 0 
whenever q §r, ..., is positively oriented on R™ we will introduce a minus 

sign into Stokes’ theorem in every other dimension! 

We let R"“ (and R" + ) have the relative topology. Since R"“ C R" we 
already have a notion of differentiability on R" _ (and hence R" + ) via definition 
??. The notions of C r maps of open subsets of half space and diffeomorphisms 
etc. is now defined in the obvious way. For convenience let us define for an open 
set U C R? _ (always relatively open) the following slightly inaccurate notations 
let dU denote R"~ n U and int(U) denote U \ dU. 

We have the following three facts: 

1. First, it is an easy exercise to show that if / : U C R" — > R d is C r 
differentiable (with r > 1) and g is another such map with the same 
domain, then if / = g on R^ _ fl U then D x f = D x g for all x € R^ _ D U. 

2. Let R)? + be a half space in aEuclidean space R d . If / : U C R n — > Rf _ 
is C r differentiable (with r > 1) and /(x) € d R^ _ then D x f : R" — » R d 
must have its image in d R^ _ . 

3. Let f : Ui C R" _ — > U 2 C R^ _ be a diffeomorphism (in our new extended 
sense). Assume that R"~ fl U\ and R^ _ fl U 2 are not empty. Then / 
induces diffeomorphisms dU\ — > dU 2 and int{U\) — > int(U 2 ). 

These three claims are not exactly obvious but there are very intuitive. On 
the other hand, none of them are difficult to prove and we will leave these 
for exercises (actually the proof of 3 is more or less obvious from the proof of 
theorem). These facts show that the notion of a boundary defined here and in 
general below is a well defined concept and is a natural notion in the context of 
differentiable manifolds; it is a “differentiable invariant”. 

We can now form a definition of manifold with boundary in a fashion com- 
pletely analogous to the definition of a manifold without boundary. A half 
space chart x a for a set M is a bijection of some subset U a of M onto an 
open subset of R?~ (or R" + for many authors). A C r half space atlas is a 
collection (x a , U a ) of such half space charts such that for any two, say (x a , U a ) 
and (x^, Up), the map x a o x^ 1 is a C r diffeomorphism on its natural domain 
(if non-empty). Note: “Diffeomorphism” means in the extended the sense of a 
being homeomorphism and such that both x a ox)' :: R"~ — > R™ and its inverse 
are C r in the sense of definition 2.5. 

Example 2.21 Review the section on pseudogroups were we defined manifold 
with boundary in that context. Is the definition there the same as that given 
here? 2.f 
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Overlap of boundary charts. 

Definition 2.30 A C r -manifold with boundary M,A is a set M together 
with a maximal atlas of half space charts A. The manifold topology is that 
generated by the domains of all such charts. The boundary of M is denoted by 
dM and is the set of point which have images in the boundary Mg o/M" _ under 
some and hence every chart. 

Definition 2.31 The interior of a manifold with boundary is a manifold with- 

O O 

out boundary and is denoted M. The manifold M is never compact and is 
referred to as an open manifold. 

Definition 2.32 In our present context, a manifold without boundary which is 
compact (and hence closed in the usual topological sense if M is Hausdorjf) is 
called a closed manifold. If no component of a manifold without boundary is 
compact it is called an open manifold. 

Remark 2.7 The phrase “closed manifold ” is a bit problematic since the word 
closed is acting as an adjective and so conflicts with the notion of closed in the 
ordinary topological sense. For this reason we will try to avoid this terminology 
and use instead the phrase “compact manifold without boundary'’ . 

Exercise 2.8 Show that M U dM is closed and that M — dA I is open. 

Remark 2.8 (Warning) Some authors let M denote the interior, so that M U 
dM is the closure and is the manifold with boundary in our sense. 
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Theorem 2.6 dAI is a C r manifold (without boundary) with an atlas being 
given by all maps of the form x a | , U a fl dM. The manifold dM is called the 

boundary of M. 

Idea of Proof. The truth of this theorem becomes obvious once we recall 
what it means for a chart overlap map y o x~ 1 : U — > V to be a diffeomorphism 
in a neighborhood a point x £ Ufl R” + . First of all there must be a set U' 
containing U which is open in R" and an extension of yox” 1 to a differentiable 
map on U' . But the same is true for (y o x -1 ) -1 = x o y _1 . The extensions are 
inverses of each other on U and V. But we must also have that the derivatives 
maps of the transition maps are isomorphisms at all points up to and including 
dU and dV. But then the inverse function theorem says that there are neigh- 
borhoods of points in dU in R™ and dV in R" such that these extensions are 
actually diffeomorphisms and inverses of each other. Now it follows that the 
restrictions yox -1 | ac/ : dU — > dV are diffeomorphisms. In fact, this is the 
main point of the comment (3) above and we have now seen the idea of its proof 
also. ■ 

Example 2.22 The closed ball B(p,R) in R™ is a manifold with boundary 
dB(p,R) = S n ~ 1 . 

Example 2.23 The hemisphere S™ = {x € R” +1 : x n+1 > 0} is a manifold 
with boundary. 

Exercise 2.9 Is the Cartesian product of two manifolds with boundary a man- 
ifold with boundary? 



Chapter 3 

The Tangent Structure 


3.1 Rough Ideas II 

Let us suppose that we have two coordinate systems x = (a; 1 , a; 2 , x n ) and 

y = (y 1 ,y 2 , y n ) defined on some common open set of a differentiable manifold 

M as defined above in 26.88. Let us also suppose that we have two lists of 

numbers v 1 , v 2 , ..., v n and tr,5, v n somehow coming from the respective 

coordinate systems and associated to a point p in the common domain of the 
two coordinate systems. Suppose that the lists are related to each other by 


v = 


E 


k = 1 


dx l _ k 

d^ V 


where the derivatives are evaluated at the coordinates y 1 {p), y 2 (p), y n (p). 
Now if / is a function also defined in a neighborhood of p then the representative 
functions for / in the respective systems are related by 


df_ 

dx i 


y, dx k df 

^ dx l dx k 
fc= 1 


The chain rule then implies that 


dx iV dx iV ' 

Thus if we had a list v l ,v 2 , ...,v n for every coordinate chart on the manifold 
whose domains contain the point p and related to each other as above then we 
say that we have a tangent vector v at p £ M. It then follows that if we define 
the directional derivative of a function f at p in the direction of v by 


vf := 
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then we are in business since it doesn’t matter which coordinate system we use. 
Because of this we think of (v l ) and (t>*) as representing the same geometric 
object (a tangent vector at p). Where do we get such vectors in a natural way? 
Well one good way is from the notion of the velocity of a curve. A differentiable 
curve though p £ M is a map c : (—a, a) — > M with c(0) = p such that the 
coordinate expressions for the curve x l {t) = (i'o c)(t) are all differentiable. We 
then take 


v 


l 


dx l 

dt 


(0) 


for each coordinate system x = (x 1 , x 2 , x n ) with p in its domain. This gives a 

well defined tangent vector v at p called the velocity of c at t = 0. We denote this 
by c'(0) or by ^|(0).Of course we could have done this for each t £ (—a, a) by 
defining v l (t) := and we would get a smoothly varying family of velocity 

vectors c' (t) defined at the points c(t) £ M. 

If we look at the set of all tangent vectors at a point p £ M we get a vector 
space since we can clearly choose a coordinate system in which to calculate and 
then the vectors just appear as n— tuples; that is, elements of R n . The vector 
space operations (scaling and vector addition) remain consistently related when 
viewed in another coordinate system since the relations are linear. The set of 
all tangent vectors at p £ M is called the tangent space at p. We will denote 
these by T p M for the various points p in M. The tangent spaces combine to 
give another differentiable manifold of twice the dimension. The coordinates 
come from those that exist on M already by adding in the “point” coordinates 
(x 1 , ....,x n ) the components of the vectors. Thus the coordinates of a tangent 
vector v at p are simply (x 1 , ..., x n \ ir , ...., v n ). 


3.2 Tangent Vectors 

For a submanifold S of R n we have a good idea what a tangent vector ought 
to be. Let t i— > c(t) = (s 1 (t), ..., x n (t)) be a curve with image contained in 
S and passing through the point p £ S at time t = 0. Then the vector v = 
c(t) = | Q c(t) is tangent to S at p. So to be tangent to S at p is to be the 

velocity at p of some curve in S through p. Of course, we must consider v to 
be based at p £ S in order to distinguish it from parallel vectors of the same 
length that may be velocity vectors of curves going through other points. One 
way to do this is to write the tangent vector as a pair {p, v) where p is the base 
point. In this way we can construct the space TSoi all vectors tangent to S as 
a subset of x R n 

TS = { (p, v) £ R n x R" : p £ S and v tangent to S at p} 

This method will not work well for manifolds that are not given as submanifolds 
of R". We will now give three methods of defining tangent vectors at a point of 
a differentiable manifold. 



3.2. TANGENT VECTORS 


49 


Definition 3.1 (Tangent vector via charts) Consider the set of all admis- 
sible charts (x a ,U a ) a& A on M indexed by some set A for convenience. Next 
consider the set T of all triples (p,v,a) such that p £ U a . Define an equivalence 
relation so that (p,v,a) ~ (q, w, /?) iff p = q and 

L>(x / 3 ° x - 1 )| x(p) - v = w 

In other words, the derivative at x(p) of the coordinate change xp o x” 1 “iden- 
tifies” v with w. Tangent vectors are then equivalence classes with the tangent 
vectors at a point p being those equivalence classes represented by triples with 
first slot occupied by p. The set of all tangent vectors at p is written as T p M. 
The tangent bundle TM is the disjoint union of all the tangent spaces for all 
points in M. 

TM := |J T p M 

peM 


This viewpoint takes on a more familiar appearance in finite dimensions if 
we use a more classical notation; Let (x, U) and (y, V) two charts containing 
p in there domains. If an n— tuple (v 1 , ...,v n ) represents a tangent vector at p 
from the point of view of (x, U) and if the n— tuple (w 1 , ...,w n ) represents the 
same vector from the point of view of (y, V) then 


u i = Y fW 

' dxi 


j=i 


x(p) 


where we write the change of coordinates as y‘ = y l (x , ...,x n ) with 1 < i < n. 
We can get a similar expression in the infinite dimensional case by just letting 


% ox )| x(p) be denoted by X 


dy 


x(p) 


then we write 


<9y 

dx 


dp) 


Definition 3.2 (Tangent vectors via curves) Letp be a point in aC r man- 
ifold with k > 1. Suppose that we have C r curves ci and C 2 mapping into man- 
ifold M , each with open domains containing 0 £ i and with Ci(0) = C2(0) = p. 
We say that c\ is tangent to C 2 at p if for all C r functions f : M — > M we have 
i\ t _ 0 f°ci = i\ t - 0 f° c i- This is an equivalence relation on the set of all such 
curves. Define a tangent vector at p to be an equivalence class X p = [ c } under 
this relation. In this case we will also write c'(0) = X p . The tangent space 
T p M is defined to be the set of all tangents vectors at p £ M. The tangent 
bundle TM is the disjoint union of all the tangent spaces for all points in M . 


TM := |_| T p M 

peM 
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The tangent bundle is actually a differentiable manifold itself as we shall 
soon see. 

If X p £ T p M for p in the domain of an admissible chart ( U a ,x a ). In this 
chart X p is represented by a triple (p, v,a). We denote by [ X p ] a the principle 
part v of the representative of X p . Equivalently, [ X p ] a = D(x a o c)| 0 for any c 
with c'(0) = X p i.e. X p = [c] as in definition 3.2. 

For the next definition of tangent vector we need to think about the set 
of functions defined near a point. We want a formal way of considering two 
functions that agree on some open set containing a point as being locally the 
same at that point. To this end we take the set F p of all smooth functions with 
open domains of definition containing p £ M. Define two such functions to be 
equivalent if they agree on some small open set containing p. The equivalence 
classes are called germs of smooth functions at p and the set of all such is 
denoted F v = F p / It is easily seen that F v is naturally a vector space and 
we can even multiply germs in the obvious way. This makes F v a ring (and an 
algebra over IR). Furthermore, if / is a representative for the equivalence class 
f £ F p then f(p ) = f(p) is well defined and so we have an evaluation map 
ev p : F p — > R. We are really just thinking about functions defined near a point 
and the germ formalism is convenient whenever we do something where it only 
matters what is happening near p. We will thus sometimes abuse notation and 
write / instead of / to denote the germ represented by a function /. In fact, we 
don’t really absolutely need the germ idea for the following kind of definition to 
work so we put the word “germs” in parentheses. 

We have defined F p using smooth functions but we can also define in an 
obvious way Fp using C r functions. 

Definition 3.3 Let f be the germ of a function / :: M-*K. Let us define the 
differential of f at p to be a map df(p) : T p M — > R. by simply composing a 
curve c representing a given vector X p = [c] with f to get /oc::R^R. Then 
define df{p) ■ X p = ^ | /oc£R. Clearly we get the same answer if we use 
another function with the same germ at p. The differential at p is also often 
written as df | . More generally, if f :: M — > E for someEuclidean space E then 
df(p) : TpM — > E is defined by the same formula. 

It is easy to check that df(p) : T p M — > E the composition of the tangent 
map T p f defined below and the canonical map T y E = E where y = /( p). Dia- 
grammatically we have 

df(p) : TpM ^TE=Ex E^E. 

Remark 3.1 (Very useful notation) This use of the “differential” notation 
for maps into vector spaces is useful for coordinates expressions. Let p £ U 
where (x, U) is a chart and consider again a tangent vector v at p. Then the 
local representative of v in this chart, is exactly dx( v) . 

Definition 3.4 A derivation of the algebra F v is a map V : F p — > R. such that 
V(fg) = f(p)^g + g(p)T>f for all f,g£ Fp. 
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Notation 3.1 The set of all derivations on T v is easily seen to be a real vector 
space and we will denote this by Der(jF p ). 

We will now define the operation of a tangent vector on a function or more 
precisely, on germs of functions at a point. 


Definition 3.5 Let T>x p : T v — * K be given by the 

Lemma 3.1 T>x p is a derivation of the algebra T v . 

Kp)' d x p 9 + g(p)h> Xp f- 

A basic example of a derivation is the partial derivative operator | ^ : 
f i— > (xq). We shall show that for a smooth n-manifold these form a basis 

for the space of all derivations at a point Xq € M. This vector space of all 
derivations is naturally isomorphic to the tangent space at Xq. Since this is 
certainly a local problem it will suffice to show this for xq £ R". 


rule V X J = df(p) ■ X p . 
That is we have V\ p (fg) = 


Notation 3.2 In the literature T>x p f is written X p f and we will also use this 
notation. As indicated above, if M is finite dimensional and C°° then all 
derivations of T v are given in this way by tangent vectors. Thus in this case 
(and not for Banach manifolds of lower differentiability) we could abbreviate 
IT x p f — X p f and define tangent vector to be derivations. For this we need a 
couple of lemmas: 

Lemma 3.2 If c is (the germ of) a constant function then Vc = 0. 

Proof. Since T> is linear this is certainly true if c = 0. Also, linearity shows 
that we need only prove the result for c = 1 . Then 

VI = V(l 2 ) 

= {Vl)c + 1V1 = 2V1 


and so VI = 0. ■ 


Lemma 3.3 Let f :: R",a’o — > R, /(x o) be defined and C°° in a neighborhood 
of x o- Then near Xq we have 


f(x) = f(x q) + (A* ~ x o) 


Ki<n 


^-\x 0 )+a\x) 


for some smooth functions a l (x) with a l (x o) = 0. 
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Proof. Write f(x) - f(x 0 ) = f 0 [f(x 0 + t(x - x 0 )] dt = E”=i( a: * “ 
Xq) /p 3c'(x, o + t(x — x 0 ))dt. Integrate the last integral by parts to get 

/ Jx* ~ x o))] dt 


df 

= [Oo + t(x - X 0 ))] 

= J^(x 0 ) + a*(x) 


r>l n 


0 J 0 , =1 


d 2 


dx i dxi 


( Xo + t(x — Xq ))dt 


where the term a l (x) clearly satisfies the requirements. 


Proposition 3.1 Let V Xo be a derivation on T x „ where Xq G R". Then 


v X0 =jy x M) 


dx i 


In particular, V corresponds to a unique vector at Xq and by the association 
(P 2;o (x 1 ), ...,V Xo {x n )) i— > V Xo we get an isomorphism o/R" with Der(jFp). 


Proof. Apply V to both sides of 


/(x) = f(x 0 ) + (x* - x l 0 ) 

l<i<n 


'df_ 

dx l 


(x 0 ) + a l (x) . 


and use 3.2 to get the formula of the proposition. The rest is easy. ■ 

An important point is that the above construction carries over via charts 
to a similar statement on a manifold. The reason for this is that if (x, U) is 
a chart containing a point p in a smooth n manifold then we can define a an 
isomorphism between Der (!F P ) and Der(iF x (p)) by the following simple rule: 

2^x(p) * Up 

V = %)(/°A 1 ). 

The one thing that must be noticed is that the vector (T> (x 1 ), ..., 2? (x n )) 

transforms in the proper way under change of coordinates so that the corre- 
spondence induces a well defined 1-1 linear map between T p M and Der(iFp). So 
using this we have one more possible definition of tangent vectors that works 
on smooth finite dimensional manifolds: 


Definition 3.6 (Tangent vectors as derivations) Let M be a smooth man- 
ifold of dimension n < oo. Consider the set of all (germs of) smooth functions 
T p at p € M. A tangent vector at p is a linear map X p : T v — > R which is 
also a derivation in the sense that for f,gGtF p 

X P {fg) = g(p)X p f + f(p)X p g. 

Once again the tangent space at p is the set of all tangent vectors at p and the 
tangent bundle is define by disjoint union as before. 
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In any event, even in the general case of a C r Banach manifold with r > 1 a 
tangent vector determines a unique derivation written X p : / i— > X p f. However, 
in this case the derivation maps T p to T p ~ Y . Also, on infinite dimensional 
manifolds, even if we consider only the C°° case, there may be derivations not 
coming from tangent vectors as given in definition 3.2 or in definition 3.1. 


3.3 Interpretations 


We will now show how to move from one definition of tangent vector to the 
next. For simplicity let us assume that M is a smooth (C°°) n-manifold. 


1 . 


Suppose that we think of a tangent vector X p as an equivalence class of 
curves represented by c : I — > M with c(0) = p. We obtain a derivation 
by defining 


X p f:= 


d 

dt 


foe 

t=0 


We can define a derivation in this way even if M is infinite dimensional 
but the space of derivations and the space of tangent vectors may not 
match up. 


2. If X p is a derivation at p and f7 Q ,x a = (ad, ..., x n ) an admissible chart 
with domain containing p , then A p ,as a tangent vector as in definition 
3.1, is represented by the triple (p, v, a) where v = (id, ...v 11 ) is given by 


v l = X p x l (acting as a derivation) 


3. Suppose that, a la definition 3.1, a vector X p aX p £ M is represented 
by (p, v, a) where vel" and a names the chart (x a ,U a ). We obtain a 
derivation by defining 


X P f=D(f ox a 1 )| Xa(p) 


In case the manifold if modelled on R" then we have the more traditional 
notation 


* P / = E 



/• 


for v = (u 1 , ...v n ). 


The notation | is made precise by the following: 

Definition 3.7 For a chart x = (ad, ... ,x n ) with domain U containing a point 
p we define a tangent vector (p~r\ p £ T p M by 

f = A(/°x _1 )(x(p)) 
v 


d_ 

dx i 
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Alternatively, we may take -£~r\ p to be the equivalence class of a coordinate 
curve. In other words, is the velocity atx(p) of the curvet i— > x~ 1 (a: 1 (p), ...,x' l (p)+ 

t , ..., x n (p)) defined for sufficiently small t. 

We may also identify gfi | p as the vector represented by the triple (p, e, ,a) 
where e* is the i-th member of the standard basis for R" and a refers to the 
current chart, x = x Q . 

Exercise 3.1 For a finite dimensional C°°— manifold M and p £ M, let x a = 

(ar, x n ), U a be a chart whose domain contains p. Show that the vectors 
g|r | p i ore a basis for the tangent space T P M . 

3.4 The Tangent Map 

The first definition of the tangent map of a map / : M, p — » N,f(p) will be 
considered our main definition but the others are actually equivalent at least 
for finite dimensional manifolds. Given / and p as above wish to define a linear 
map T p f : T p M — *■ T f ^N 



Tangent map as understood via curve transfer. 

For the next version recall remark 3.1. 

Definition 3.8 (Tangent map I) If we have a smooth function between man- 
ifolds 

f : M -> N 

and we consider a point p £ M and its image q = f(p) £ N then we 
define the tangent map at p by choosing any chart (x, U) containing p 
and a chart (y, V) containing q = f(p) and then for any v £ T p M we have 
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the representative dx(v) with respect to (x,U). Then the representative of 
T p f ■ v is given by 


dj (T p f ■ v ) = D( y o / o x x ) • dx(v). 

This uniquely determines T p f ■ v and the chain rule guarantees that this 
is well defined (independent of the choice of charts). 

Definition 3.9 (Tangent map II) If we have a smooth function between man- 
ifolds 

f : M N 

and we consider a point p € M and its image q = f(p) £ N then we define the 

tangent map at p 

Tpf : T p M - T q N 

in the following way: Suppose that v £ T p M and we pick a curve c with c(0) = p 
so that v = [c], then by definition 


T p f -v= [foe] £ T q N 

where [f o c] € T q N is the vector represented by the curve f o c. 

An alternative definition for finite dimensional smooth manifolds in terms 
of derivations is the following. 

Definition 3.10 (Tangent Map III) Let M be a smooth n-manifold. View 
tangent vectors as derivation as explained above. Then continuing our set up 
above and letting g be a smooth germ at q = f(p ) £ N we define the derivation 
Tpf ■ v by 

(Tpf • v)g = v(f o g) 

It is easy to check that this defines a derivation on the (germs) of smooth func- 
tions at q and so is also a tangent vector in T q M . Thus we get a map T p f called 
the tangent map (atp). 


3.5 The Tangent and Cotangent Bundles 

3.5.1 Tangent Bundle 

We have defined the tangent bundle of a manifold as the disjoint union of the 
tangent spaces TM = \_\ peM T p M. We show in proposition 3.2 below that TM 
is itself a differentiable manifold but first we record the following two definitions. 

Definition 3.11 Give a smooth map f : M —> N as above then the tangent 
maps on the individual tangent spaces combine to give a map Tf : TM — > TN 
on the tangent bundles that is linear on each fiber called the tangent lift. 
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Definition 3.12 The map tm '■ TM — > M defined by tm{v) = p for every 
p G T p M is called the (tangent bundle) projection map. The TM together with 
the map tm '■ TM — > M is an example of a vector bundle. 


Proposition 3.2 TM is a differentiable manifold and tm '■ TM — > M is a 
smooth map. Furthermore, for a smooth map f : M — > N the tangent map is 
smooth and the following diagram commutes. 

TM ^ TN 
tm l I t n 

M -4 N 

Now for every chart (x, U) let TU = (U). The charts on TM are defined 

using charts from M are as follows 

Tx : TU — > Tx(TU) £* x(U) x R” 

Tx : £ i— > (xotm(O.v) 

where v = dx(() is the principal part of ( in the x chart. The chart Tx,TU is 
then described by the composition 


£ ^ {tm( 0,0 ( x ° r M (0> dx (0) 


but x o tm( 0 it is usually abbreviated to just x so we may write the chart in 
the handy form (x, dx). 


TU - 

-a x(U) x R' 

l 

1 

U - 

- x(f7) 


For a finite dimensional manifold and with and a chart x = (x 1 , ...,x n ), any 
vector ( G r J ^ 1 (f7) can be written 


£ 


£»*(«) 


a 

dx i 


Tm(5) 


for some u*(£) G R depending on (. So in the finite dimensional case the chart 
is just written (ar, ..., x n , v 1 , ..., v n ). 


Exercise 3.2 Test your ability to interpret the notation by checking that each 
of these statements makes sense and is true: 

1 ) Iff,=f. 1 5 §t| p and x a (p) = (a 1 ,..., a 11 ) G x Q (U a ) thenTx a {£fi = (a 1 , ..., a”, ..., ( n ) G 

U a x R”. 

2) If v = [c] for some curve with c(0) = p then 

Tx a (v) = (x a o c(0), — x Q o c) G U a x R™ 
dt t = o 
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Suppose that (x, dx) and (y, dy) are two such charts constructed as above 
from two charts U, x and V, y and that U fl V 0. Then TU fl TV 0 and on 
the overlap we have the coordinate transitions Ty o Tx” 1 :(x, v) i— > (y,w) where 


y = y°x *(x) 

n 

w=^C(y°x- 1 )| x(p) v 

k=i 

and so the overlaps will be C r ~ 1 whenever the y o x _1 are C r . Notice that for 
all p £ x(U fl V) we have 


yO)=yox 1 (x(p)) 

dy(0 = D (y° x ~ 1 )\ x{p) dx (0 


or with our alternate notation 


= I 


1 dx(£) 


dp) 


and in finite dimensions the classical notation 



dy \ 0 = 

or 


y\x\...,x n ) 



y = y o x J (x) 

dy\ 


dx 


x(p) 


This classical notation may not be logically precise but it is easy to read and 
understand. Recall our notational principle ??. 


Exercise 3.3 If M is actually equal to an open subset U of a model space M 
then it seems that we have (at least) two definitions of TU . How should this 
difference be reconciled? 


3.5.2 The Cotangent Bundle 

Each T p M has a dual space T*M. In case M is modelled on aEuclidean space 
R" we have T p M « R" and so we want to assume that T*M « R n *. 
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Definition 3.13 Let us define the cotangent bundle of a manifold M to be 
the set 

T*M := |J T*M 

p£M 

and define the map it := ttm '■ U p gM M — ■> M to be the obvious projection tak- 
ing elements in each space T*AI to the corresponding point p. Let {U, x} oe ^ be 
an atlas of admissible charts on M . Now endow T*M with the smooth structure 
given by the charts 

:T*U= 7 Tm\U) -> T*x(T*U) S x(U) x (M")* 

where the map (Tx -1 )* the contragradient ofTx. 

If M is a smooth n dimensional manifold and x 1 , x n are coordinate func- 
tions coming from some chart on M then the “differentials” dx 1 1 , dx n \ p are 

a basis of T*M basis dual to A I , I . Let a &T*U . Then we can write 

a = > aha) dx l \ , , 

for some numbers a* (a) depending on a. In fact, a.i(a) = So 

if U, x = (x, x n ) is a chart on an n-manifold M, then the natural chart 
(TU,T*x) defined above is given by 

a i— > (x 1 o 7r(a), x n o n(a), a\{a), a n (a)) 

and abbreviated to (x 1 , x", ai, a n ). 

Suppose that (x 1 , ..., x n , oi, a n ) and (x, x", di, d n ) are two such 
charts constructed in this way from two charts on U and U respectively with 
U fl 17 ^ 0. Then T*U C\T*U ^ 0 and on the overlap we have the coordinate 
transitions 


(TxT 1 )* o (Tx)* : x(U 0 17) x M"* -> x(U n U) x (IT)* 
or 

(Tx o Tx.- 1 )* : x(U D 17) x M n * -► x(U n 17) x (R n )*. 


Notation 3.3 The contragradient of D(x ox 1 ) at x £ x(U fl U) is the map 

d*x, 


dx 


(x) : (R n )* 


1 


defined by 


^(x) ' a = (D{xox ^(x))" 


When convenient we also write 


dp) 


a. 
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With this notation we can write coordinate change maps as 
(x> a) ^ |(xox' 1 )(x), ^(x) • a j . 

Write (x o x -1 )* := pryo(x o x^ 1 ) and then 

i' = (x O X -1 )*^ 1 O 7 T, X n O 7r) 

n 

hi = ^(£>(x o x _ 1 ))fa/c. 
k = 1 

and classically abbreviated even further to 


Pi 

This is the socallecl “index notation” and does not generalize well to infinite 
dimensions. The following version is index free and makes sense even in the 
infinite dimensional case: 


= &*(ar, ... ,x n ) 
dx k 


= Pk 


dx l 


x = x o X 
d*x 

a “ ax 


L (x) 


dp) 


This last expression is very nice if inaccurate and again is in line with our 
notational principle ??. 

Exercise 3.4 Show that the notion of a “cotangent lift” only works if the map 
is a diffeomorphism. 


3.6 Important Special Situations. 

If the manifold in question is an open subset U of a vector space V then the 
tangent space at any a: € V is canonically isomorphic with V itself. This was 
clear when we defined the tangent space at x as {x} x V. Then the identifying 
map is just v i— > (x, v). Now one may convince oneself that the new more 
abstract definition of T X U is essentially the same thing but we will describe the 
canonical map in another way: Let v £ V and define a curve c v : K. — *U C V 
by c v (t) = x + tv. Then T 0 c v • 1 = c v (0) € T X U. The map v i— > c v (0) is then our 
identifying map. The fact that there are these various identifications and that 
some things have several “equivalent” definitions is somewhat of a nuisance to 
the novice (occasionally to the expert also). The important thing is to think 
things through carefully, draw a few pictures, and most of all, try to think 
geometrically. One thing to notice is that for a vector spaces the derivative 
rather than the tangent map is all one needs in most cases. For example, 
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if one wants to study a map / : U C V — > W then if v p = (p,v) € T p U 
then T p f ■ v p = T p f ■ (p, v) = (p, Df | • v). In other words the tangent map is 
(p,v) i— > (p, Df | p • v) and so one might as well just think about the ordinary 
derivative D p f . In fact, in the case of a vector space some authors actually 
identify T p f with Df | p as they also identify T p U with V. There is no harm in 
this and actually streamlines the calculations a bit. 

Another related situation is the case of a manifold of matrices such as 
GL(n, R). Here GL(n,M) is actually an open subset of the set of all n x n- 
matrices M raxn (R). The latter is a vector space so all our comments above ap- 
ply so that we can think of M„ xn (R) as any of the tangent spaces T x GL(n, R). 
Another interesting fact is that many important maps such as c g : x i— > g t xg are 
actually linear so with the identifications T x GL(n,R) = M„ xn (R) and we have 

T x c g “ = ” Dc x \ g = c x : M nxn (R) — *■ M nxn (R). 

Definition 3.14 (Partial Tangential) Suppose that f : M-| x M 2 — > IV is a 

smooth map. We can define the partial maps as before and thus define partial 
tangent maps: 


(■ dif ) (x,y) : T X M\ — > T f ( x . y) N 
(^2 f) (x,y) ■ T y M 2 -»■ T f{XtV) N 

Next we introduce a natural identification . It is obvious that a curve c : 
I —> Mi x M 2 is equivalent to a pair of curves 

Ci : I — > Mi 

c 2 : I — > M 2 

The infinitesimal version of this fact gives rise to a natural identification 


T( x ,y)(M i x M 2 ) = T x Mi x T y M 2 


This is perhaps easiest to see if we view tangent vectors as equivalence classes 
of curves (tangency classes). Then if we choose c = (ci, c 2 ) so that c(0) = (a;, y) 
then we identify ^ = [c] G T( XiV )(Mi x M 2 ) with ([ci],[c 2 ]) € T x Mi x T y M 2 . 
For another view, consider the insertion maps t x : y i— > ( x , y) and l v : x i— > 
( x,y ). We have linear monomorphisms Tl v (x) : T X M\ — >■ T^ xy ){Mi x M 2 ) 
and Ti x {y) : T y M 2 — * T^ x y ^{Mi x M 2 ). Let us denote the images of T x Mi 
and T y M 2 in T^ xy ){AIi x M 2 ) under these two maps by the same symbols 
(T X M) i and ( T y M) 2 . We then have the internal direct sum ( T x M)i © ( T y M) 2 
= T( Xi j / )(MixM 2 ) and the map Tl v x Tl x : T x M 1 xT y M 2j — >■ (! T x M)\®(T y M) 2 C 
T( Xty )(Mi x M 2 ). The inverse of this map is T^ xy )pr i x T^ xy ^pr 2 which is 
then also taken as an identification. One way to see the naturalness of this 
identification is to see the tangent functor as taking the commutative diagram 
in the category of pairs 
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Mi x M 2 , (z,y) 


pr 1 
/ 


pr 2 

\ 


id I 

M 2 ,y 

Ly \ 


1* X 


Mi x M 2 , (a;,y) 


to the new commutative diagram 


Tpr 1 

T(x,y)(M 1 x M 2 ) 

Tpr 2 

/ 

I 

\ 

T X M X 

T X M\ x T y M 2 

TyM 2 

T X ly \ 

I 

/ t x l x 


(T x M)i © (T y M) 2 



Notice that we have / o l v = f tV and f o l x = f x . Looking again at the 
definition of partial tangential one arrives at 

Lemma 3.4 (partials lemma) For a map f : M\ x M 2 — » N we have 

T (x, y )f ' (v, w) = (di /) (a;, y) ■ v + (<9 2 /) (a:, y) • w. 

where we have used the aforementioned identification Tt x y ) (Mi x M 2 ) = TL y ) (Mi x 
M 2 ). 

Proving this last lemma is much easier and more instructive than reading 
the proof. Besides, it easy. 

The following diagram commutes: 

T( x ,y){M 1 x M 2 ) 

\ 

r ^'{x,y)P^ 1 X -^(x,y)P?’2 t Tf(x,y)N 

/ 

T X M\ x T y M 2 

Essentially, both diagonal maps refer to T( x , y )f because of our identification. 
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Chapter 4 

Submanifold, Immersion 
and Submersion. 

4.1 Submanifolds 

Recall the simple situation from calculus where we have a continuously differ- 
entiable function F{x, y) on the x,y plane. We know from the implicit func- 
tion theorem that if ^(xo,yo) 7^ 0 then near (xo,yo) the level set F(x,y) = 
F{xo, yo) is the graph of some function y = g(x). The map (x,y) e- > x is a 
homeomorphism onto an open subset of ffi. and provides a chart. Hence, near 
this point, the level set is a 1-dimensional differentiable manifold. Now if either 
x, y) yf 0 or (x, y) ^ 0 at every (x, y) on the level set, then we could cover 
the level set by these kind of charts (induced by coordinate projection) and so 
we would have a smooth 1-manifold. This idea generalizes nicely not only to 
higher dimensions but to manifolds in general and all we need is local theory of 
maps as described by the inverse and implicit mapping theorems. 

There is another description of a level set. Locally these are graphs of 
functions. But then we can also parameterize portions of the level sets by using 
this local graph structure. For example, in the simple situation just described 
we have the map t 1— > (t + xq , g(t + yo )) which parameterizes a portion of the 
level set near Xo,yo- The inverse of this parameterization is just the chart. 

First we define the notion of a submanifold and study some related gen- 
eralities concerning maps. We then see how this dual idea of level sets and 
parameterizations generalizes to manifolds. The reader should keep in mind 
this dual notion of level sets and parameterizations. 

A subset S of a C r -differentiable manifold M (modelled on K") is called a 
(regular ) submanifold (of M) if there exists a decomposition of the model 
space R n = R" _fe x such that every point p £ S is in the domain of an 
admissible chart (x, U) which has the following submanifold property: 

x(u ns) = x(U) n (R n ~ k x {o)} 
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X ) 


(x t x r ,o 


., 0 ) 


Equivalently, we require that x : U — > V\ x Vi C K" k x R fc is a diffeomorphism 
such that 

x(C7 n S) = Vi x {0} 

for open Vi, V 2 . We will call such charts adapted to S. The restrictions x| r/nS 
of adapted charts provide an atlas for S (called an induced submanifold atlas ) 
making it a differentiable manifold in its own right. The k above is called the 

codimension of S (in M). 

Exercise 4.1 Show that S really is differentiable manifold and that a continu- 
ous map f : N — > M which has its image contained in S is differentiable with 
respect to the submanifold atlas iff it is differentiable as a map in to AI. 

When S' is a submanifold of M then the tangent space T p S at p £ S C M 
is intuitively a subspace of T p M. In fact, this is true as long as one is not 
bent on distinguishing a curve in S through p from the “same” curve thought 
of as a map into M. If one wants to be pedantic then we have the inclusion 
map 1 : S M and if c : I — > S curve into S then l o c : I — > M is a 
map into M as such. At the tangent level this means that c'(0) £ T p S while 
(r o c) , (0) £ T p M. Thus from this more pedantic point of view we have to 
explicitly declare T p i : T p S — * T p i(T p S) C T p AI to be an identifying map. We 
will avoid the use of inclusion maps when possible and simply write T p S C T p M 
and trust the intuitive notion that T p S is indeed a subspace of T p M. 
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4.2 Submanifolds of 

If M C R" is a regular fc-dimensional submanifold then, by definition, for every 
p £ M there is an open subset U of R" containing p on which we have new 
coordinates (p : U — > V C R” abbreviated by (y 1} .... ,y n ) such that .M DC/ is 
exactly given by yu+i = •••• = y n = 0. On the other hand we have the identity 
coordinates restricted to U which we denote by X\, ...,x n . We see that id o <p ~ 1 
is a diffeomorphism given by 

£i = £i(2/i, — -,2/n) 

£2 = £2(2/1, 

£n — £n (// 1 1 — , Vn) 

which in turn implies that the determinant det ~ must be nonzero 
throughout V. From a little linear algebra we conclude that for some renum- 
bering of the coordinates the Xi,....,x n the determinant det must 

be nonzero at and therefore near <f>(p) £ V C R n . On this possibly smaller 
neighborhood V' we define a map F by 

= £ 1 ( 2/1 > ....,y n ) 

X2 = £2(2/1, — 5 Vn) 

x r = x r ( 1 / 1 , — , 2/n) 

3V+1 = 2/r+l 


— Un 

then we have that F is a local diffeomorphism V' C R n — > R n . Now let 4>~ 1 U' = 
V' and form the composition ip := F o (f> which is defined on U' and must have 
the form 

Zi = Xi 

Z2 = £2 


Z-p — X'p 

Zr- (-1 — VV+l(£l, , £n) 

Z n = ^n(£l, • — ,£n) 

From here is it is not hard to show that z r + 1 = • • • = z n = 0 is exactly the set 
ip(Mr\U') and since <p restricted to a M C\U’ is a coordinate system so ip restricted 
to M fl U‘ is a coordinate system for M. Now notice that in fact ip maps a point 
with standard (identity) coordinates («i, a n ) onto («i, a r , 0, 0). Now 
remembering that we renumbered coordinates in the middle of the discussion 
we have proved the following theorem. 
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Figure 4.1: Projection onto some plane gives a chart. 


Theorem 4.1 If M is an r -dimensional regular submanifold of K" then for 
every p € M there exists at least one r-dimensional coordinate plane P such 
that linear projection P — > R" restricts to a coordinate system for M defined in 
a neighborhood of p. 


4.3 Regular and Critical Points and Values 

Proposition 4.1 If f : M — » N is a smooth map such that T p f : T p M —> T q N 
is an isomorphism for all p £ M then f : M — > N is a local diffeomorphism. 

Definition 4.1 Let f : M — > N be C r -map and p £ M we say that p is a 
regular point for the map f if T p f is a splitting surjection (see 26. 44) and 
is called a singular point otherwise. For finite dimensional manifolds this 
amounts to the requirement that T p f have full rank. A point q in N is called 
a regular value of f if every point in the inverse image f~ 1 {q} is a regular 
point for f. A point of N which is not regular is called a critical value. The 
set of regular values is denoted 7 Zf. 

It is a very useful fact that regular values are easy to come by in that most 
values are regular. In order to make this precise we will introduce the notion 
of measure zero on a manifold. It is actually no problem to define a Lebesgue 
measure on a manifold but for now the notion of measure zero is all we need. 
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Figure 4.2: Four critical points of the height function. 


Definition 4.2 A set A in a smooth finite dimensional manifold M is said to 
be of measure zero if for every admissible chart U, the set f>{A f~l U) has 
Lebesgue measure zero in R" where dimM = n. 

In order for this to be a reasonable definition the manifold must be second 
countable so that every atlas has a countable subatlas. This way we may be 
assured that every set which we have defined to be measure zero is the countable 
union of sets which are measure zero as view in a some chart. We also need to 
know that the local notion of measure zero is independent of the chart. This 
follows from 

Lemma 4.1 Let M be a n-manifold. The image of a measure zero set under a 
differentiable map is of measure zero. 

Proof. We are assuming, of course that M is Hausdorff and second count- 
able. Thus any set is contained in the countable union of coordinate charts 
we may assume that / : U C R n — > R n and A is some measure zero sub- 
set of U. In fact, since A is certainly contained in the countable union of 
compact balls ( all of which are translates of a ball at the origin) we may as 
well assume that U = 73(0, r) and that A is contained in a slightly smaller 
ball 73(0, r — 5) C 73(0, r). By the mean value theorem, there is a constant 
c depending only on / and its domain such that for x,y £ 73(0, r) we have 
| f(y) — f(x) | < c\x — y\. Let e > 0 be given. Since A has measure zero there is 
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a sequence of balls B(xi, e,) such that A C (J B(xi, ef) and 

Thus f(B(xi,ei)) C B(f(xi), 2ce,) and while /(A) C (J 2ce,) we also 

have 

voi (|jB(/(*i), 2 cei)) < 
y^vol(-B(/(^),2cet)) < vol(-Bi) (2cej) n 

< 2 n c n ^2 vol (B(xi, e*)) 

< e. 

Thus the measure of T is less that or equal to e. Since e was arbitrary it follows 
that A has measure zero. ■ 

Corollary 4.1 Given a fixed A = {U a ,x a } atlas for M, if x a {A fl U a ) has 
measure zero for all a then A has measure zero. 

Theorem 4.2 (Sard) Let M be an n-manifold and N an m-manifold ( Haus - 
dorjf and second countable) . For a smooth map f : M — > N the set of regular 
values IZf has Lebesgue measure zero. 

Proof. Through the use of a countable cover of the manifolds in question 
by charts we may immediately reduce to the problem of showing that for a 
smooth map / : U C R" — » R m the set of critical values C C U has image /(C) 
of measure zero. We will use induction on the dimension n. For n = 0, the 
set /(C) is just a point (or empty) and so has measure zero. Now assume the 
theorem is true for all dimensions j < n — 1. We seek to show that the truth of 
the theorem follows for j = n also. 

Using multiindex notation (26.3) let 

Ci := {x £ U : — — —(x) = 0 for all led < *}. 

1 dx a ~ 


Then 

C = (C\Ci) U (Ci\C 2 ) U • • • U (C fc _AC fc ) u c k 

so we will be done if we can show that 

a) /(C\Ci) has measure zero, 

b) f(Cj_i\Cj ) has measure zero and 

c) f(C k ) has measure zero for some sufficiently large k. 

Proof of a): We may assume that m > 2 since if rn = 1 we have C = C\. 
Now let x £ C\C\ so that some first partial derivative is not zero at x = a. By 
reordering we may assume that this partial is Jp- and so the 

{x 1 ,...,x n ) i ^ (f(x),x 2 ,...,x n ) 
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map restricts to a diffeomorphism cf> on some open neighborhood containing x. 
Since we may always replace / by the equivalent map f ocj)^ 1 we may go ahead 
and assume without loss of generality that / has the form 

f -x (x 1 , f 2 (x), ..., f m (x)) := (x 1 ,h(x)) 


on some perhaps smaller neighborhood V containing a. The Jacobian matrix 
for / in V is of the form 

'1 O' 

* Dh 

and so x £ V is critical for / if and only if it is critical for h. Now h(C fl V) C 
R m_1 and so by the induction hypothesis h(C fl V) has measure zero in R m_1 . 
Now f(C nF)fl ({a;} x R m_1 ) C {x}xh(C fl V) which has measure zero in 
{: x } x R" 1 ” 1 = R m_1 and so by Fubini’s theorem f(C fl V) has measure zero. 
Since we may cover C by a countable number of sets of the form C D V we 
conclude that f(C) itself has measure zero. 

Proof of (b): The proof of this part is quite similar to the proof of (a). Let 
a £ Cj-{\Cj. It follows that some fc-th partial derivative is 0 and after some 
permutation of the coordinate functions we may assume that 


d 0101 f 1 
dx 1 dxP 


(a) ± 0 


O | f3 | £ 1 

for some j — 1- tuple (3 = (ii, ...,ij- 1 ) where the function g := is zero at 

a since a is in Ck- 1 - Thus as before we have a map 


x i ^ {g( x),x 2 ,...,x n ) 


which restricts to a diffeomorphism (j) on some open set V. We use <j>,V as a 
chart about a. Notice that <j>{Cj- 1 fl V) C 0 x R™ -1 . We may use this chart <j) 
to replace / by g = f o 0 -1 which has the form 

x i— > (x 1 , h(x)) 

for some map h : V — > R m_1 . Now by the induction hypothesis the restriction 
of g to 

g 0 ■■ {0} x JR"" 1 nh^R m 

has a set of critical values of measure zero. But each point from 0(Cj_i fl 
V) C 0 x R n_1 is critical for go since diffeomorphisms preserve criticality. Thus 
g o (f>(Cj- 1 fl V) = f(Cj- 1 fl V) has measure zero. 

Proof of (c): Let I n (r) C U be a cube of side r. We will show that if 
k > (n/m) — 1 then /(/™(r) fl Ck) has measure zero. Since we may cover by 
a countable collection of such V the result follows. Now Taylor’s theorem give 
that if a £ I n {r) fl Ck and a + h £ I n (r) then 


fc+i 


I f(a + h) - /(a) | < c\h\ 


(4.1) 
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for some constant c which depends only on / and /"(r). We now decompose 
the cube J"(r) into R n cubes of side length r/R. Suppose that we label these 

cubes which contain critical points of / as D\, Dm- Let D* contains a critical 

point a of /. Now if y € D then \y — a| < i /nr/R so using the Taylor’s theorem 
remainder estimate above (4.1) with y = a + h we see that f(Di ) is contained 
in a cube D.; C R m of side 


2c 



b 


where the constant b (^/nr) k+1 is independent of the particular cube D from 
the decomposition and depends only on / and J"(r). The sum of the volumes 
of all such cubes Di is 


S<R n 



m 


which, under the condition that m(fc+ 1) > n, may be made arbitrarily small be 
choosing R large (refining the decomposition of /"(r)). The result now follows. 


Corollary 4.2 If M and N are finite dimensional manifolds then the critical 
values of a smooth map f : M N are dense in N. 

4.4 Immersions 

Definition 4.3 A map f : M —> N is called an immersion at p £ M iff 
T p f : T p M — > Ty( p )7V is a linear injection (see 26-43) at p. A map f : M — > N 
is called an immersion if f is an immersion at every p € M. 

Figure 4.3 shows a simple illustration of an immersion of R 2 into R 3 . This 
example is also an injective immersion (as far as is shown) but an immersion 
can come back and cross itself. Being an immersion at p only requires that the 
restriction of the map to some small open neighborhood of p is injective. If an 
immersion is (globally) injective then we call it an immersed submanifold (see 
the definition 4.4 below). 

Theorem 4.3 Let f : M n — > N d be a smooth function which is an immersion 
at p. Then f : M n — > N d there exists charts x :: ( M n ,p ) — » ( R",0) and 
y :: ( N d ,f(p )) — > ( R d ,0) such that 

y o / o x _1 :: R” — > R ra x R d ~ n 

is given by x. i— > (x, 0) near 0. In other words, there is a open set U C M 
such that f(U) is a submanifold of N the expression for f is (x 1 ,...,x n ) i— > 
(x 1 , ..., x n , 0, ..., 0) e R d . 

Proof. Follows easily from theorem 26.12. ■ 
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Figure 4.3: Embedding of the plane into 3d space. 


Theorem 4.4 If f : M — > N is an immersion (so an immersion at every point) 
and if f is a homeomorphism onto its image f(M) using the relative topology, 
then f(M) is a regular submanifold of N. In this case we call f : M — > N an 

embedding. 

Proof. Follows from the last theorem plus a little point set topology. ■ 


4.5 Immersed Submanifolds and Initial Subman- 
ifolds 

Definition 4.4 If I : S — > M is an injective immersion then ( S , I) is called an 

immersed submanifold. 

Exercise 4.2 Show that every injective immersion of a compact manifold is an 
embedding. 

Theorem 4.5 Suppose that M is an n— dimensional smooth manifold which 
has a finite atlas. Then there exist an injective immersion of M into R 2n+1 . 
Consequently, every compact n— dimensional smooth manifold can be embedded 
into R 2n+1 . 

Proof. Let M be a smooth manifold. Initially, we will settle for an im- 
mersion into K 15 for some possibly very large dimension D. Let {Oi, <Pi}i£N be 
an atlas with cardinality N < oo. The cover {Oi} cover may be refined to two 
other covers {Ui}i e N and {P} ie jv such that Ui C Vi C Vi C Oi . Also, we may 
find smooth functions /; : M — > [0, 1] such that 

fi(x) = 1 for all x e Ui 
supp (fi) C Oi. 
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Next we write ipi = (xj,....x™) so that x\ : 0,; — > R is the j— th coordinate 
function of the i-th chart and then let 

fij ■= fiX 3 i (no sum) 

which is defined and smooth on all of M after extension by zero. 

Now we put the functions /j together with the functions f l:] to get a map 
i:M -> R n + Nn : 


i — (/ 1) fn, /ll> /l2j f'21, , /njv) • 

Now we show that i is injective. Suppose that i{x) = i(y). Now fk{x) must 
be 1 for some k since x £ Uk for some k. But then also fk{y) = 1 also and this 
means that y £ 14 (why?). Now then, since fk{x) = fk(y) = 1 it follows that 
fkj(x) = fkj{y) for all j. Remembering how things were defined we see that x 
and y have the same image under tpk : Ok — > R™ and thus x = y. 

To show that T x i is injective for all x £ M we fix an arbitrary such x and 
then x £ Uk for some k. But then near this x the functions fk\J'k/ 2 , ■■■, fkn, are 
equal to x\, .... x j? and so the rank of i must be at least n and in fact equal to n 
since dim T X M = n. 

So far we have an injective immersion of M into R ra+iVl \ 

We show that there is a projection tt : R D — > L C R 15 where L = R 2n+1 is a 
2n+l dimensional subspace of R 15 , such that nof is an injective immersion. The 
proof of this will be inductive. So suppose that there is an injective immersion / 
of M into R d for some d with D > d > 2n+l. We show that there is a projection 
nd ■ R d — >■ T d_1 = R d_1 such that "Kd 0 f is still an injective immersion. To this 
end, define a map h : M x M x R — > R d by h( x, y, t) := t(f(x) — f(y)). Now since 
d > 2n+l, Sard’s theorem implies that there is a vector y £ R d which is neither 
in the image of the map h nor in the image of the map df : T M — > R d . This y 
cannot be 0 since 0 is certainly in the image of both of these maps. Now if pr± y 
is projection onto the orthogonal compliment of y then pr± y o / is injective; for 
if pr± y o f(x) = pr± y o f(y) then f(x) — f(y) = ay for some a £ R. But suppose 
x yf y. then since / is injective we must have a / 0. This state of affairs is 
impossible since it results in the equation h(x, y, 1/a) = y which contradicts our 
choice of y. Thus pr± y o / is injective. 

Next we examine T x (pr± y o /) for an arbitrary x £ M. Suppose that 
T x (pr± y o f)v = 0. Then d(pr± y o f)\ v = 0 and since pr± y is linear this 
amounts to pr± y o df\ x v = 0 which gives df \ x v = ay for some number a £ R 
which cannot be 0 since / is assumed an immersion. But then df \ x -v = y 
which also contradict our choice of y. 

We conclude that pr± y o / is an injective immersion. Repeating this process 
inductively we finally get a composition of projections pr : R D — > R 2ra+1 such 
that pr o f : M —> R 2 ” +1 is an injective immersion. ■ 

It might surprise the reader that an immersed submanifold does not neces- 
sarily have the following property: 
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Figure 4.4: Counter example: consider the superposition. 


Criterion 4.1 Let S and M be a smooth manifolds. An injective immersion 
I : S — > M is called smoothly universal if for any smooth manifold N , a 
mapping f : N — > S is smooth if and only if I o / is smooth. 

To see what goes wrong, imagine the map corresponding to superimposing 
one of the figure eights shown in figure 4.4 onto the other. If / : S — > M 
is an embedding then it is also smoothly universal but this is too strong of a 
condition for our needs. For that we make the following definitions which will 
be especially handy when we study foliations. 

Definition 4.5 Let S be any subset of a smooth manifold M. For any x £ S 
denote by C X (S) the set of all points of S that can be connected to x by a smooth 
curve with image entirely inside S. 

Definition 4.6 A subset S C M is called an initial submanifold if for 
each sq £ S there exists a chart U,x centered at sq such that x(C So (U D S)) = 
x(U) n (R d x {0}) for some splitting ]R” = x R n ~ d (which is independent of 

s o )■ 

The definition implies that if S is an initial submanifold of M then it has a 
unique smooth structure as a manifold modelled on and it is also not hard 
to see that any initial submanifold S has the property that the inclusion map 
S ^ M is smoothly universal. Conversely, we have the following 

Theorem 4.6 If an injective immersion I : S — > M is smoothly universal then 
the image f(S) is an initial submanifold. 

Proof. Choose So € S. Since I is an immersion we may pick a coordinate 
chart w : W — > K d centered at So and a chart v : V — ■> R" = R d x R n-d centered 
at I{so) such that we have 

vo I o w _1 (y) = (y, 0). 
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Choose an r > 0 small enough that B(0, r) C w(U) and B(0,2r ) C w(V). 
Let Uo = v~ 1 (B(0,r)) and Wo = w _1 (C). We show that the coordinate chart 
Vo, u := <t\v 0 satisfies the property of lemma 4.6. 

u-\u(U 0 ) n (R d x {0})) = «- 1 {(y, 0) : ||y|j < r} 

= I o ur 1 o (u o I o w _1 ) _1 ({(y, 0) : ||y|| < r}) 

= louf'Oy : ||y|| < r}) = I(W 0 ) 

Now I (Wo) C Uo fl I(S) and since I (Wo) is contractible we have I (Wo) C 
C f{so) (Uo n I(S)). Thus u-\u(Uo) n ( x {0})) c c fM (Uo n I(S)) or 

u(Uo) n (R d X {o}) c u(c IM (u 0 ni(S))). 

Conversely, let 2 £ C/( So )(Uo ft I(S)). By definition there must be a smooth 
curve c : [0, 1] — > S starting at /(s 0 ), ending at z and c([0, 1]) C U 0 Cl(S). Since 
I : S — > M is smoothly universal there is a unique smooth curve C\ : [0, 1] — » S 
with I o C\ = c. 

Claim 4.1 ci([0, 1]) C Wo- 

Assume not. Then there is some number t £ [0,1] with Ci(t) £ u> -1 ({r < 
||y||<2 r}). The 

(v o I)(ci(t)) £ (v o I o tu _1 )({r < ||y|| < 2r}) 

= {(y, 0) : r < ||y|| < 2r} C {z £ R n : r < ||y|| < 2r}. 

Now this implies that (voIoa)(t) = (vo c)(t) £ {z £ R n : r < ||y|| < 2r} which 
in turn implies the contradiction c(t) ^ Uq. The claim is proven. 

Now the fact that ci ( [0, 1] ) C Wo implies ci(l) = / _1 (z) £ Wo and so 
z £ I (Wo). As a result we have Cj( So )(Uo fl I(S)) = I (Wo) which together with 
the first half of the proof gives the result: 

/(Wo) = u-'Kf/o) n (R d X {o})) c c f(so) (Uo n i(S)) = i(w 0 ) 

=► w 1 (w(c/ 0 ) n (R d X {o})) = c fM (Uo n i(S)) 

=► u(Uo) n (R d X {o}) = u(c f{so) (Uo n i(S))). 

■ 

We say that two immersed submanifolds (Si, Ii) and (S 2 , If) are equivalent 
if there exist a diffeomorphism 4> : — > S 2 such that I 2 0 4> = I\ ; i.e. so that 

the following diagram commutes 

$ 

5! - S 2 

\ / 

M 
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Now if I : S — > M is smoothly universal so that f(S) is an initial submanifold 
then it is not hard to see that (S,I) is equivalent to ( ) where i is the 
inclusion map and we give f(S) the unique smooth structure guaranteed by the 
fact that it is an initial submanifold. Thus we may as well be studying initial 
submanifolds rather than injective immersions that are smoothly universal. For 
this reason we will seldom have occasion to even use the terminology “smoothly 
universal” which the author now confesses to be a nonstandard terminology 
anyway. 


4.6 Submersions 

Definition 4.7 A map f : M — > N is called a submersion at p £ M iff 
T p f : T p M — > Tf( p )N is a (bounded) splitting surjection (see 26-43). f : M — > N 
is called a submersion if f is a submersion at every p € M. 

Example 4.1 The map of the punctured space R 3 — {0} onto the sphere S 2 
given by x i— > \x\ is a submersion. To see this use spherical coordinates and 
the map becomes ( p,4>,0 ) i— > (<j>,9). Here we ended up with a projection onto a 
second factor R x R 2 — > R 2 but this is clearly good enough. 

Theorem 4.7 Let f : M — > N be a smooth function which is an submersion 
at p. Then there exists charts x :: (. M,p ) — » (R" -fe x R fc ,0) = ( R n ,0) and 
y :: ( N,f(p )) — > (R fe ,0) such that 

y o / o x" 1 :: (R n ~ k x R fc , 0) -► (R fe , 0) 

is given by (x, y) i— > x near 0 = (0, 0). 

Proof. Follows directly from theorem 26.14. ■ 

Corollary 4.3 (Submanifold Theorem I) Consider any smooth map f : M - 

N then if q £ N is a regular value the inverse image set f^ 1 (q) is a regidar sub- 
manifold. 

Proof. If q £ N is a regular value then / is a submersion at every p £ / _1 (q) . 
Thus for any p £ f~ 1 {q) there exist charts ip :: ( M,p ) — > ( R" -fc x R fe ,0) = ( 
R™, 0) and (f> :: (N,f(j>)) — » ( R ra_fc ,0) such that 

(p o f o if- 1 :: (R n ~ k x R fc , 0) (R”" fc , 0) 

is given by (x, y) i— > x near 0. We may assume that domains are of the nice form 

[/'xf'Cf/iyi; v'. 

But (j> o f o %f>~ 1 is just projection and since q corresponds to 0 under the diffeo- 
morphism we see that ip(U Cf~ 1 (q)) = (</>o /o^ _1 ) _1 (0) = prf 1 ( 0) = U' x {0} 
so that ,f~ 1 (q) has the submanifold property at p. Now p £ f~ 1 (q) was arbitrary 
so we have a cover of f~ 1 (q) by submanifold charts. ■ 
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Example 4.2 (The unit sphere) The set 5 11-1 = {a: € : x ■ x = 1} is a 

codimension 1 submanifold o/M” since we can use the map ( x,y ) i— > x 2 + y 2 as 
our map and let q = 1. 

Given k functions Fl(x,y) on R" x R fc we define the locus 

M := {(; x,y ) £ I" x : F\x,y ) = d>} 

where each c? is a fixed number in the range of F° . If the Jacobian determinant 
at (x 0 ,y 0 ) e M; 

dF 

det -K~{.x 0 ,y 0 ) 
dy 

is not zero then near (xq, yo) then we can apply the theorem. We can see things 
more directly: Since the Jacobian determinant is nonzero, we can solve the 
equations F^{x,y) = cP for y 1 , ...,y n in terms of x 1 , ...,x k : 

y 1 = / 1 ( a;1 ) x k ) 
y 2 = f 2 (x 1 ,...,x k ) 

y n = f n {x\...,x k ) 

and this parameterizes M near (xo,yo) in such a nice way that the inverse is a 
chart for M. This latter statement is really the content of the inverse mapping 
theorem in this case. If the Jacobian determinant never vanishes on M then we 
have a cover by charts and M is a submanifold of R n x R fc . 

It may help the understanding to recall that if Fi{x,y) and (xo,j/o) are as 
above then we can differentiate the expressions F : ’(x,y(x)) = C to get 

dF 1 v- OF 1 dy s 

dx 3 ^ 3y s dx : > 


and then solve 


dy j 

dx i 




,■ dF s 
s dx i 


where [J 1 ]^ is the matrix inverse of the Jacobian ^ evaluated at points near 

(xo,yo)- 


Example 4.3 The set of all square matrices M nxn is a manifold by virtue of 
the obvious isomorphism M nxn = R n . The set st)tn(n, M) of all symmetric 
matrices is an n(n + 1) / 2 - dimensional manifold by virtue of the obvious 1-1 
correspondence stim(n,R) = R n (" +1 i/ 2 given by using n(n + l)/2 independent 
entries in the upper triangle of the matrix as coordinates. 

Now the set 0(n,M.) of all nxn orthogonal matrices is a submanifold of M nxn . 
We can show this using Theorem f.3 as follows. Consider the map f : M nxn — ► 
st)tn(n,K) given by A i— > T 4 A. Notice that by definition of 0(n,M.) we have 
/ _1 (/) = 0(n,K). Let us compute the tangent map at any point Q G / _1 (J) = 
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0(n,R). The tangent space o/stjm(n, K) at I isstim(n,R) itself since R) 

is a vector space. Similarly, M nxn is its own tangent space. Under the identi- 
fications of section 3. 6 we have 

T Q f ■ v = ^-{Q l + sv t ){AQ + si>) = i fQ + Q*v. 
ds 

Now this map is clearly surjective onto stjm(rc,R) when Q = I. On the other 
hand, for any Q £ 0(n, R) consider the map Lq-i : M nxn — * M nxn given by 
Lq-i(B) = Q~ X B. The map TqLq- i is actually just TqLq- i • v = Q~ 1 v which 
is a linear isomorphism since Q is a nonsingular. We have that f o Lq = / 
and so by the chain rule 


T Q f ■ v = T// o T q (L q -i) ■ v 
= T r f ■ Q~ 1 v 

which shows that Tq f is also surjective. 

The following proposition shows an example of the simultaneous use of Sard’s 
theorem and theorem4.3. 

Proposition 4.2 Let M be a connected submanifold o/R™ and let S be a linear 
subspace o/R n . Then there exist a vector v € R" such that (v + S) D M is a 
submanifold of M . 

Proof. Start with a line l through the origin that is normal to S. Let 
pr : R" — ■> S be orthogonal projection onto l . The restriction tt := pr\ M — » l 
is easily seen to be smooth. If n(M) were just a single point x then 7r _1 (a;) 
would be all of M. Now 7 r(M) is connected and a connected subset of l = R 
must contain an interval which means that n(M) has positive measure. Thus 
by Sard’s theorem there must be a point v £ l which is a regular value of n. But 
then 4.3 implies that 7 r — 1 (z;) is a submanifold of M. But this is the conclusion 
since 7r — 1 (v) = (v + S) fl M. m 

4.7 Morse Functions 

If we consider a smooth function / : M — > R and assume that M is a compact 
manifold (without boundary) then / must achieve both a maximum at one or 
more points of M and a minimum at one or more points of M. Let x e be one of 
these points. The usual argument shows that df\ = 0 (Recall that under the 
usual identification of R with any of its tangent spaces we have df\ e = T e f ). 
Now let Xo be some point for which df | =0. Does / achieve either a maximum 

or a minimum at Xq7 How does the function behave in a neighborhood of Xq ? 
As the reader may well be aware, these questions are easier to answer in case the 
second derivative of / at Xq is nondegenerate. But what is the second derivative 
in this case? One could use a Riemannian metric and the corresponding Levi- 
Civita connection to be introduced later to given an invariant notion but for us 
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it will suffice to work in a local coordinate system and restrict our attention to 
critical points. Under these conditions the following definition of nondegeneracy 
is well defined independent of the choice of coordinates: 


Definition 4.8 The Hessian matrix of f at one of its critical points Xq and 
with respect to coordinates if = (ar, ... ,x n ) is the matrix of second partials: 


H = 


dx 1 dx 1 


Oo) 


d 2 foil >- 1 

dx 1 dx n 


Oo) 


d 2 foj >~ 1 

dx n dx 1 


Oo) 


d 2 foj >~ 1 

dx n dx n 


(x 0 ) 


The critical point is called nondegenerate if H is nonsingular 


Now any such matrix H is symmetric and by Sylvester’s law of inertia this 
matrix is equivalent to a diagonal matrix whose diagonal entries are either 1 or 
— 1. The number of —1 occurring is called the index of the critical point. 


Exercise 4.3 Show that the nondegeneracy is well defined. 

Exercise 4.4 Show that nondegenerate critical points are isolated. Show by 
example that this need not be true for general critical points. 

The structure of a function near one of its nondegenerate critical points is 
given by the following famous theorem of M. Morse: 

Theorem 4.8 (Morse Lemma) If f : M — > R is a smooth function and Xo is 
a nondegenerate critical point for f of index i. Then there is a local coordinate 
system U,x containing Xq such that the local representative fu := / o x -1 for 
has the form 

fuix 1 , ••., x n ) = f(x 0 ) + ^2 hijX l x 3 

and where it may be arranged that the matrix h = ( hij ) is a diagonal matrix of 
the form diag(— 1, ... — 1,1,. ..,1) for some number (perhaps zero) of ones and 
minus ones. The number of minus ones is exactly the index i. 

Proof. This is clearly a local problem and so it suffices to assume / :: 
R™ — > R and also that /( 0) = 0. Then our task is to show that there exists a 
diffeomorphism <j> : R" — > R™ such that / o <j>(x) = x 3 hx for a matrix of the form 
described. The first step is to observe that if g : U C R" — > R is any function 
defined on a convex open set U and c/(0) = 0 then 
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Thus g is of the form g = Y^=i u i9i for certain smooth functions <?i, 1 < i < n 
with the property that dig( 0) = gi(0). Now we apply this procedure first to / 
to get / = J2i=i u ifi where 9^/(0) = /»( 0) = 0 and then apply the procedure 
to each f, and substitute back. The result is that 

n 

f(ui, ...,u n ) = u iUj h^( Ul ,...,u n ) (4.2) 

i,j = 1 

for some functions h lJ with the property that h lJ () is nonsingular at and there- 
fore near 0. Next we symmetrize (hP) by replacing it with \{h^ + hf l ) if 
necessary. This leaves the expression 4.2 untouched. Now the index of the 
matrix (h‘ J (0)) is i and this remains true in a neighborhood of 0. The trick 
is to find, for each x in the neighborhood a matrix C(x) which effects the di- 
agonalization guaranteed by Sylvester’s theorem: D = C(x)h(x)C(x)~ 1 . The 
remaining details, including the fact that the matrix C(x) may be chosen to 
depend smoothly on x, is left to the reader. ■ 

We can generalize theorem 4.3 using the concept of transversality . 

Definition 4.9 Let f : M — > TV be a smooth map and S C N a submanifold 
of N. We say that f is transverse to S if for every p € f~ 1 (S) the image of 
T p M under the tangent map T p f and Tf^S together span all ofTj^N: 

T f{p) N = T m S + T p f(T p M). 

If f is transverse to S we write f rh S. 

Theorem 4.9 Let f : M — » N be a smooth map and S C N a subman- 
ifold of N and suppose that f rh S. Then f~ 1 (S) is a submanifold of M. 
Furthermore we have T p (f~ 1 (S)) = Tf ( p ) / _1 (Tf ( P )S) for all p € f^ 1 (S) and 
codim(f^ 1 (S) )=codim(S ). 

4.8 Problem set 

1. Show that a submersion always maps open set to open set (it is an open 
mapping). Further show that if M is compact and N connected then a 
submersion / : M — > N is a submersion must be surjective. 

2. Show that the set of all symmetric matrices Sym nxn (^) is a submani- 
fold of M nxn (K). Under the canonical identification of TsM nxn (M.) with 
M„ xn (K) the tangent space of Sym nxn ( R) at the symmetric matrix S 
becomes what subspace of M nxra (K)? Hint: It’s kind of a trick question. 

3. Prove that in each case below the subset f~ 1 (po) is a submanifold of M: 

a) / : M = M nxn (M.) -> Sym nxn { R) and f{Q) = Q ( Q with p 0 = I the 
identity matrix. 

b) / : M = K” — > R 2 and f{x\, ..., x n ) = (x\ 2 + ■ ■ ■ + x n 2 , — x± 2 + ■ ■ ■ + 
x n 2 ) with p 0 = (0,0). 
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4. Show that the set of all m x n matrices is a submanifold of M mxn (R). 

5. Show that if p(x) = p(x\, x n ) is a homogeneous polynomial so that for 
some to £ Z + 

p(tx i, fz„) = t m p(x 1: x n ) 

then as long as c yf 0 the set p _1 (c) is a n — 1 dimensional submanifold of 


6. Suppose that g : AI N is transverse to a submanifold W C N. For 
another smooth map / : Y — > M show that / rh g _1 (N) if and only if 
(go f) (b W. 

7. Suppose that c : [a, 6] — ► M is a smooth map. Show that given any 
compact subset C C (a, b) and any e > 0 there is an immersion 7 : 
(a, b) M which agrees with c on the set C and such that 

|7(f) — c(t)| < e for all t G (a, b). 



Chapter 5 

Lie Groups I 


5.1 Definitions and Examples 


One approach to geometry is to view geometry as the study of invariance and 
symmetry. In our case we are interested in studying symmetries of differentiable 
manifolds, Riemannian manifolds, symplectic manifolds etc. Now the usual 
way to talk about symmetry in mathematics is by the use of the notion of a 
transformation group. The wonderful thing for us is that the groups that arise 
in the study of manifold symmetries are themselves differentiable (even analytic) 
manifolds. Perhaps the reader should keep in mind the prototypical example 
of the Euclidean motion group and the subgroup of rotations about a point in 
Euclidean space R”. The rotation group is usually represented as the group of 
orthogonal matrices O(n) and then the action on R” is given in the usual way 
as 

x i— > Qx 

for Q € 0(n ) and x a column vector from R n . It is pretty clear that this group 
must play a big role in physical theories. Of course, translation is important 
too and the combination of rotations and translations generate the group of 
Euclidean motions E(n). We may represent a Euclidean motion as a matrix 
using the following trick: If we want to represent the transformation x — > Qx + b 
where x, b £ R n we can achieve this by letting column vectors 


1 

x 


e R n+1 


represent elements x. The we form 

' 1 0 
b Q 

and notice that 


' 1 o' 

' 1 ' 


1 

b Q 

X 


b + Qx 
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Thus we may identify E{ri) with the group of (n + 1) x ( n + 1) matrices of the 
above form where Q £ O(n). Because of the above observation we say that 
E[n) is the semidirect product of 0(n) and R n and write 0(n) X R n . 

Exercise 5.1 Make sense of the statement E(n)/0(n) = R”. 

Lie groups place a big role in classical mechanics (symmetry and conserva- 
tion laws), fluid mechanic where the Lie group may be infinite dimensional and 
especially in particle physics (Gauge theory). In mathematics, Lie groups play a 
prominent role Harmonic analysis (generalized Fourier theory) and group repre- 
sentations as well as in many types of geometry including Riemannian geometry, 
algebraic geometry, Kahler geometry, symplectic geometry and also topology. 

Definition 5.1 A C°° differentiable manifold G is called a Lie group if it is a 
group ( abstract group ) such that the multiplication map g : G x G — > G and the 
inverse map v : G — > G given by g(g, h) = gh and ix(g) = g~ x are C°° functions. 

If G and // are Lie groups then so is the product group G x H where 
multiplication is ( 51 , hf) ■ ( 92 ,^ 2 ) = ( 9 i 92 j ^ 1 ^ 2 )- Also, if H is a subgroup of a 
Lie group G that is also a regular closed submanifold then H is a Lie group itself 
and we refer to H as a (regular) Lie subgroup. It is a nontrivial fact that an 
abstract subgroup of a Lie group that is also a closed subset is automatically a 
Lie subgroup (see 8.4). 

Example 5.1 R is a one- dimensional (abelian Lie group) were the group mul- 
tiplication is addition. Similarly, any vector space, e.g. R", is a Lie group and 
the vector addition. 

Example 5.2 The circle S ' 1 = {z £ C : \z\ 2 = 1} is a 1-dimensional (abelian) 
Lie group under complex multiplication. More generally, the torus groups are 
defined by T n = S 1 x ...x S 1 . 

n-times 

Now we have already introduced the idea of a discrete group action. We are 
also interested in more general groups action. Let us take this opportunity to 
give the relevant definitions. 

Definition 5.2 A left action of a Lie group G on a manifold M is a smooth map 
A : GxM — > M such that X(gi, X(g 2 , to)) = X(g±g 2 , m)) for all 91, 92 G G. Define 
the partial map X g : M — > M by X g (m) = X(g,m) and then the requirement is 
that X : g 1 — > X g is a group homomorphism G — > Diff(M). We often write 
X (< 7 , m) as g ■ to. 

Example 5.3 (The General Linear Group) The set of all non-singidar ma- 
trices GL(n, R) under multiplication. 


Example 5.4 (The Orthogonal Group) The set of all orthogonal matrices 
0(n,R) under multiplication. Recall that an orthogonal matrix A is one for 
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which AA l = I. Equivalently, an orthogonal matrix is one which preserves the 
standard inner product on R n : 


Ax ■ Ay = x ■ y. 

More generally, let V, b = (., .) be any n- dimensional inner product space (b is 
just required to be nondegenerate) and let k be the maximum of the dimensions 
of subspaces on which b is positive definite. Then define 

Ok,n-k(y ,b) = {A £ GL(V) : ( Av,Aw ) = ( v,w ) for all v,w £ V}. 

Ofc,n-fc(V ,b) turns out to be a Lie group. As an example, take inner product on 
R” defined by 

k n 

{x, y) ■= x%y% ~ x ' yl 

i= 1 i=l+l 

and denote the corresponding group by Ok, n -k(M n )- We can identify this group 
with the group Oik , n — k) of all matrices A such that AAA * = A where 

A = diag{ 1, -1, ... ). 

k times n—k times 

0(1, 3) is called the Lorentz group. If we further require that the elements of our 
group have determinant equal to one we get the groups denoted SO(k,n — k). 

Exercise 5.2 Show that if 0 < k < n then SO(k,n — k) has two connected 
components. 

Example 5.5 (The Special Orthogonal Group) The set of all orthogonal 
matrices having determinant equal to one SO(n , R) under multiplication. This 
is the k = n case of the previous example. As a special case we have the rotation 
group £0(3, R) =£0(3). 

Example 5.6 (The Unitary Group) U( 2) is the group of all 2x2 complex 
matrices A such that A A* = I. 

Example 5.7 (The Special Unitary Group) SU( 2) is the group of all 2x2 

complex matrices A such that A A* = I and det(A) = 1. There is a special 
relation between £0(3) and £17(2) that is important in quantum physics related 
to the notion of spin. We will explore this in detail later in the book. 

Notation 5.1 A* := A * 

Example 5.8 The set of all A £ GL(n, R) with determinant equal to one is 
called the special linear group and is denoted £L(n,R). 

Example 5.9 Consider the matrix 2 n x 2 n matrix 
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J defines a skew-symmetric bilinear u > o form on R 2n by ujq(v,w) = v t Jw. The 
set of all matrices in M 2nx2n that preserve this bilinear form is called the sym- 
plectic group Sp(n, R) and can be defined also as 

Sp(n,H) = {S € M 2nX 2 n ; S t JS = J. 

This group is important in Hamiltonian mechanics. If e i, ..., e n , ..., e 2n is a the 
standard basis ofM. 2n then u > o = Yffi—i e * A ei+ n . We will see that in some basis 
every nondegenerate 2-form on R 2 " takes this form. 

Exercise 5.3 Show that SU(2) is simply connected. 

Exercise 5.4 Let g £ G (a Lie group). Show that each of the following maps 
G — > G is a diffeomorphism: 

1) L g : x i— > gx (left translation) 

2) R g : x i— > xg (right translation) 

3) C g : x i— > gxg~ l (conjugation) . 

4) inv : x i— > x _1 (inversion) 

5.2 Lie Group Homomorphisms 

The following definition is manifestly natural: 

Definition 5.3 A smooth map f : G — > H is called a Lie group homomor- 
phism if 


/ ( 51 . 92 ) = f(gi)f(92) for all 51, 52 € G and 
fig -1 ) = f(g) -1 for all g GG. 

and an isomorphism in case it has an inverse which is also a Lie group homo- 
morphism. A Lie group isomorphism G — » G is called a Lie group automor- 
phism. 


Example 5.10 The inclusion SO(n,M.) Gl{n,M) is a Lie group homomor- 
phism. 


Example 5.11 

and the map 


The circle S 1 C C is a Lie group under complex multiplication 


cos (9) sin(0) 0 
— sin(5) cos(5) 0 
0 0 1 


is a Lie group homomorphism into SO(n). 


Example 5.12 The conjugation map C g : G — > G is a Lie group automor- 
phism. 
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Exercise 5.5 (*) Show that the multiplication map g : GxG — > G has tangent 
map at the identity (e, e) £ G x G given as Ti e ^g(v,w) = v + w. Recall that 
we identify Tg,^(G x G ) with T e G x T e G. 

Exercise 5.6 Show that Gl(n,M) is an open subset of the vector space of all 
n x n matrices and that it is a Lie group. Using the natural identification of 
T e Gl(n,M) with M nx „(R) show that as a map M nxn (R) — > M nxn (R) we have 

T e C g : x i-> gxg~ x 


where g £ Gl(n,M) and x £ M nxn (R). 

Example 5.13 The map t i— > e lt is a Lie group homomorphism from R to 
S 1 C C. 

Definition 5.4 A closed (Lie) subgroup H of a Lie group G is a closed 
regidar submanifold that is also a Lie group with respect to the submanifold 
differentiable structure. 

Example 5.14 S 1 embedded as S 1 x {1} in the torus S 1 x S 1 is a subgroup. 

Example 5.15 SO(n) is a subgroup of both 0{n ) and GL(n). 

Definition 5.5 An immersed subgroup of a Lie group G is a smooth injec- 
tive immersion (or it’s image) which is also a Lie group homomorphism. 

Remark 5.1 It is an unfortunate fact that in this setting a map is referred to 
as a “submanifold”. This is a long standing tradition but we will avoid this 
terminology as much as possible preferring to call maps like those in the last 
definition simply injective homomorphisms. 

Definition 5.6 A homomorphism from the additive group R into a Lie group 
is called a one-parameter subgroup. 

Example 5.16 The torus S 1 x S 1 is a Lie group under multiplication given 
by (e lTl , e tSl )(e lT2 , e* 02 ) = (e^ Tl+T2 \ e*^ 1+e2 ^). Every homomorphism o/R into 
S 1 x S 1 , that is, ever one parameter subgroup of S 1 x S' 1 is of the form t i— > 
( e tal , e tbl ) for some pair of real numbers a,b £ R. 

Example 5.17 The map R : R — >■ SO( 3) given by 

( cos 6 — sin 6 0 \ 

sin 9 cos 9 0 ] 

0 0 1 / 


is a one parameter subgroup, (an hence a homomorphism). 
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Example 5.18 Given an element g of the group 517(2) we define the map Ad g : 
su( 2 ) — > su( 2 ) by Ad g : x <— > gxg -1 . Now the skew-Hermitian matrices of zero 
trace can be identified with R 3 by using the following matrices as a basis: 

0 -i W 0 -1 W -i 0 \ 

-i o y’Vi o y’Vo i )~ 

These are just —i times the Pauli matrices oi, 02,03 1 and so the correspon- 
dence su(2) — > R 3 is —xio\ — yia -2 — iza 3 1 — > (x, y , z) ■ Under this correspondence 
the inner product on R 3 becomes the inner product (A, B) = trace(AB). But 
then 


(Ad s A, Ad g B) = trace(gAgg x Bg 3 ) 

= trac e(AB) = (A,B) 

so actually Acl ff can be thought of as an element of 0(3). More is true; Ad s acts 
as an element of SO( 3) and the map g \-^Ad g is then a homomorphism from 
SU(2) to SO{s u(2)) = 50(3). 

The set of all Lie groups together with Lie group homomorphisms forms an 
important category which has much of the same structure as the category of all 
groups. The notions of kernel, image and quotient show up as expected. 

Theorem 5.1 Let h : G — > H be a Lie group homomorphism. The subgroups 
Ker(h) C G, Img(h) C H are Lie subgroups. 


1 check this 
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Fiber Bundles and Vector 
Bundles I 

First of all lets get the basic idea down. Think about this: A function defined 
on the circle with range in the interval (0, 1) can be thought of in terms of its 
graph. The latter is a subset, a cross section, of the product S 1 x (0,1). Now, 
what would a similar cross section of the Mobius band signify? This can’t be the 
same thing as before since a continuous cross section of the Mobius band would 
have to cross the center and this need not be so for S 1 x (0,1). Such a cross 
section would have to be a sort of twisted function. The Mobius band (with 
projection onto its center line) provides us with our first nontrivial example of 
a fiber bundle. The cylinder S 1 x (0, 1) with projection onto one the factors 
is a trivial example. Projection onto S 1 gives a topological line bundle while 
projection onto the interval is a circle bundle. Often what we call the Mobius 
band will be the slightly different object which is a twisted version of S 1 x M 1 . 
Namely, the space obtained by identifying one edge of [0, 1] x R 1 with the other 
but with a twist. 

A fiber bundle is to be though of as a bundle of -or parameterized family of- 
spaces E x = 7r _1 (;r) C E called the fibers. Nice bundles have further properties 
that we shall usually assume without explicit mention. The first one is simply 
that the spaces are Hausdorff and paracompact. The second one is called local 
triviality. In order to describe this we need a notion of a bundle map and the 
ensuing notion of equivalence of fiber bundles. 

Most of what we do here will work either for the general topological category 
or for the smooth category so we once again employ the conventions of 2.6.1. 

Definition 6.1 A general C r - bundle is a triple £ = (E,n,X) where n : E — > 
M is a surjective C r -map of C r -spaces (called the bundle projection). For each 
p £ X the subspace E p := 7r _1 (p)is called the fiber over p. The space E is called 
the total space and X is the base space. If S C X is a subspace we can always 
form the restricted bundle (Es,ns, S) where Es = 7r _1 (S l ) and ns = 7ij s is the 
restriction. 
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Definition 6.2 A (C r — ) section of a general bundle tte : E — » M is a (C r —) 
map s : M — > E such that tte ° s = Mm ■ In other words, the following diagram 
must commute: 

s E 

/ I tte 
M -► M 
id 


The set of all C r —sections of a general bundle tte : E — > M is denoted by 
T k (M, E). We also define the notion of a section over an open set U in At is 
the obvious way and these are denoted by T k (U,E). 

Notation 6.1 We shall often abbreviate to just T(U, E) or even T(E) whenever 
confusion is unlikely. This is especially true in case k = oo (smooth case) or 
k = 0 (continuous case). 

Now there are two different ways to treat bundles as a category: 

The Category Bun. 

Actually, we should define the Categories Bunk ;fc = 0, 1, ...., oo and then ab- 
breviate to just “Bun” in cases where a context has been establish and confusion 
is unlikely. The objects of Bunk are C r — fiber bundles. 

Definition 6.3 A morphism from HoniB ttni , (£i, £ 2 ,), also called a bundle map 
from a C r — fiber bundle £1 := Xf) to another fiber bundle £ 2 := ( E 2 , n 2 , X 2 ) 

is a pair of C r —maps (/, /) such that the following diagram commutes: 

Ei E 2 

I i 

Xi 4 x 2 

If both maps are C r -isomorphisms we call the map a (C r -) bundle isomorphism. 

Definition 6.4 Two fiber bundles £1 := {Ei,tt\,Xi) and (2 '■= (£' 2 ) 7r 2 iX 2 ) are 
equivalent in Bunk or isomorphic if there exists a bundle isomorphism from 
Ci to C 2 ■ 

Definition 6.5 Two fiber bundles £1 := (Ei,tti,Xi) and £ 2 := (f? 2 , 7r 2 , X 2 ) are 
said to be locally equivalent if for any y £ E\ there is an open set U containing 
p and a bundle equivalence (/, /) of the restricted bundles: 

Ei\U 4 E 2 \f(U) 

I I 

U 4 f(U) 


The Category Bunk(X) 
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Definition 6.6 A morphism from also called a bundle 

map over X from a C r — fiber bundle £1 := X) to another fiber bundle 

£2 := {E 2 ,tt 2 , X) is a C r —map f such that the following diagram commutes: 

E\ -4 E 2 

\ / 

X 


If both maps are C r -isomorphisms we call the map a (C r -) bundle isomor- 
phism, over X (also called a bundle equivalence) . 

Definition 6.7 Two fiber bundles £1 := (Ei,iri, X) and £2 := {E 2 ,n 2 ,X) are 

equivalent in Bunk(X) or isomorphic if there exists a (C r -) bundle isomor- 

phism over X from £1 to f 2 ■ 

By now the reader is no doubt tired of the repetitive use of the index C r so 
from now on we will simple refer to space (or manifolds) and maps where the 
appropriate smoothness C r will not be explicitly stated unless something only 
works for a specific value of r. 

Definition 6.8 Two fiber bundles £1 := (Ei,tti, X) and f 2 := (E 2 ,n 2 ,X) are 
said to be locally equivalent (over X) if for any y € E\ there is an open set 
U containing p and a bundle equivalence (/, /) of the restricted bundles: 

Ei \ U -4 E 2 \U 

i 1 

u H u 

Now for any space X the trivial bundle with fiber F is the triple (X x 
F,pr i,X) where pr\ always denoted the projection onto the first factor. Any 
bundle over X that is bundle equivalent to X x F is referred to as a trivial 
bundle. We will now add in an extra condition that we will usually need: 

Definition 6.9 A (— ) fiber bundle ( := (E,tt,X) is said to be locally trivial 
(with fiber F) if every for every x £ X has an open neighborhood U such that 
£u := (Ejj,ttu,U) is isomorphic to the trivial bundle (U x F,pr\,U). Such a 
fiber bundle is called a locally trivial fiber bundle. 

We immediately make the following convention: All fiber bundles in the 
book will be assumed to be locally trivial unless otherwise stated. 

Once we have the local triviality it follows that each fiber E p = 7r _1 (p) is 
homeomorphic (in fact, — diffeomorphic) to F. 

Notation 6.2 We shall take the liberty of using a variety of notions when talk- 
ing about bundles most of which are quite common and so the reader may as well 
get used to them. One, perhaps less common, notation which is very suggestive 
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is writing £ := (F E —>■ X ) to refer to a fiber bundle with typical fiber F. 
The notation suggests that F may be embedded into E as one of the fibers. This 
embedding is not canonical in general. 

It follows from the definition that there is a cover of E by bundle charts 
or (local) trivializing maps f a : Ejj a — > U a x F such that 

E Ua ^ U a x F 

7T \ »/ 

u a 

which in turn means that the so called overlap maps f a offi l :U a OlJxF—> 
U a D U x F must have the form f a o f^ 1 (x, u) = (x, fp a ,x(u)) for maps x — > 
f/3a,x G Dif f r {F) defined on each nonempty intersection U a fl U. These are 
called transition functions and we will assume them to be —maps whenever 
there is enough structure around to make sense of the notion. Such a cover by 
bundle charts is called a bundle atlas. 

Definition 6.10 It may be that there exists a C r -group G ( a Lie group in 
the smooth case) and a representation p of G in Dif f r (F ) such that for each 
nonempty U a CUp we have fp a , x = p{.9ap{x)) for some C r -map g a p : U a C\Up — » 
G. In this case we say that G serves as a structure group for the bundle via 
the representation p. In case the representation is a faithful one then we may 
take G to be a subgroup of Dif f r (F) and then we simply have fp a ,x = gap{x). 
Alternatively, we may speak in terms of group actions so that G acts on F by 
— diffeomorphisms. 

The maps g a p : U a fl Up —> G must satisfy certain consistency relations: 

g aa (x ) = id for x £ U a 

g a p{x)gp a (x) = id for x G U a (~1 Up (6.1) 

9<xp{x)g^(x)g ia (x) = id for x £ U a n Up (1 U 1 . 

A system of maps g a p satisfying these relations is called a cocycle for the cover 
{U Q }. 

Definition 6.11 A fiber bundle £ := (F E —>■ X) together with a G action 
on F is called a G-bundle if there exists a bundle atlas {(U a , /„)} for £ such 
that the overlap maps have the form f a o f)f 1 (x,y) = (x,g a p(x) ■ y)) cocycle 
{g a p} for the cover {U a }. 

Theorem 6.1 Let G have C r — action on F and suppose we are given cover of 
a C r -space M and cocycle {gap} for a some cover {U a }a(sA- Then there exists 
a G-bundle with an atlas {( U a ,f a )} satisfying f a o / / ^ 1 (a:,j/) = (x,g a p( x) ■ y )) 
on the overlaps. 
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Proof. On the union £ := (J {a} x U a x F define an equivalence relation 
such that 


( a,u,v ) G {a} x U a x F 


is equivalent to (/?, x, y) G {/3} x Up x F iff u = x and v = g a p(x) ■ y. 

The total space of our bundle is then E := £/ ~. The set £ is essentially 
the disjoint union of the product spaces U a x F and so has an obvious topology. 
We then give E := £/ ~ the quotient topology. The bundle projection tt e is 
induced by (a,u,v) e- > u. Notice that 7Tg (£/„) To get our trivializations we 
define 

f a (e) := (u,v) for e G 7 r^ 1 (t/ Q ) 


where (it, v) is the unique member of U a x F such that (a, u, v ) G e. The 
point here is that (a,u\,vi) ~ (a, 112,1*2) only if = ( 1 ( 2 , V2) ■ Now 

suppose U a CiUp ^ 0. Then for x G U a nUp the element fp{x,y) is in 
^{Ua n Up) = 7 T^(U a ) n Kb 1 {Up) and so /^{xpy) = [{(3,x,y)\ = [(a,u,t))]. 

Which means that x = u and v = g a p(x) • y. From this it is not hard to see 
that 

f a 0 fp\x,y) = (. x,g a p(x ) • y)). 

We leave the question of the regularity of the maps and the C r structure to the 
reader. ■ 

An important tool in the study of fiber bundles is the notion of a pullback 
bundle. We shall see that construction time and time again. Let £ = (F 
E —> M) be a -fiber bundle and suppose we have a -map / : X — > M. We want 
to define a fiber bundle /*£ = {F f*E — > X). As a set we have 


f*E = {{x, e) G X x E : f(x) = 7r(e)}. 


The projection f*E — » X is the obvious one: (a;, e) 1 — > x G N. 


Exercise 6.1 Exhibit fiber bundle charts for f*E. 

If a cocycle {gap} for a some cover {U a }a^A determine a bundle £ = {F 
E M) and / : X — > M as above, then {g a p°f} = {f*9ap} is a cocycle for the 
same cover and the bundle determined by this cocycle is (up to isomorphism) 
none other than the pullback bundle /*£. 

{gap} ^ ? 

{f*9ap} - r? 

The verification of this is an exercise that is easy but constitutes important 
experience so the reader should not skip the next exercise: 


Exercise 6.2 Verify that above claim. 

Exercise 6.3 Show that if A C M is a subspace of the base space of a bundle 
£ = (F c — > E — > M) and 1 : A c — > M then is naturally isomorphic to the 

restricted bundle = {F Ea A). 
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An important class of fiber bundles often studied on their own is the vector 
bundles. Roughly, a vector bundle is a fiber bundle with fibers being vector 
spaces. More precisely, we make the following definition: 

Definition 6.12 A real (or complex) (— ) vector bundle is a (— ) fiber bundle 
(E,n,X) such that 


(i) Each fiber E x := ir 1 (x) has the structure of a real (resp. complex) vector 
space. 

(ii) There exists a cover by bundle charts (U a , f a ) such that each restriction 
f a \ E is a real (resp. complex) vector space isomorphism. We call these 

vector bundle charts or VB-charts. . 

The set of all vector bundles is a category Vect. Once again we need to 
specify the appropriate morphisms in this category and the correct choice should 
be obvious. A vector bundle morphism ( or vector bundle map) between 
£i := (Ei, 7Ti, Xf] and ( 2 := (A 2 ,7r 2 , X 2 ) is a bundle map (/,/) : 

E x -4 E 2 
I I 
X! 4 X 2 

which is linear on fibers. That is / is a linear map from tt( 1 (x) into 

the fiber Ttf l (x). We also have the category Vect(X) consisting of all vector 
bundles over the fixed space X. Here the morphisms are bundle maps of the 
form ( F,idx )• Two vector bundles := (Ei,m,X) and ( 2 := (E 2 ,tt 2 ,X) over 
the same space X are isomorphic (over X) if there is a bundle isomorphism 
over X from to ( 2 which is a linear isomorphisms when restricted to each 
fiber. Such a map is called a vector bundle isomorphism. 

In the case of vector bundles the transition maps are given by a representa- 
tion of a Lie group G as a subgroup of Gl(n, F). More precisely, if ( = (F fc 
E M) there is a Lie group homomorphism p : G — » Gl(k, F) such that for 
some VB-atlas {U a , f a } we have the overlap maps 

/a o fp 1 : u a n Up X F fc - u a n Up x F fe 

are given by f a of)f 1 (x, v) = (x, p(g a p(x))v) for a cocycle {g a p}. In a great many 
cases, the representation is faithful and we may as well assume that G C Gl(k , F) 
and that the representation is the standard one given by matrix multiplication 
v i— > gv. On the other hand we cannot restrict ourselves to this case because 
of our interest in the phenomenon of spin. A simple observation that gives a 
hint of what we are talking about is that if G C G(n, R) acts on R fc by matrix 
multiplication and h : G — > G is a covering group homomorphism ( or any Lie 
group homomorphism) then v i— > g ■ v := h(g)v is also action. Put another way, 
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if we define ph : G — > G(n,R) by ph(g) = h(g)v then ph is representation of 
G. The reason we care about this seemingly trivial fact only becomes apparent 
when we try to globalize this type of lifting as well will see when we study 
spin structures later on. To summarize this point we may say that whenever we 
have a VB-atlas {U a , f a } we have the transition functions {f a p} = {x h- > fp a ,x} 
which are given straight from the overlap maps f a of( ! 1 (x, u) by f a °fp 1 { x i u ) = 
( x , fj 3a,x( u ))- Of course, the transition functions {f a p} certainly form a cocycle 
for the cover {U a } but there may be cases when we want a (not necessarily 
faithful) representation p : G — > Gl(k, F”) of some group G not necessarily a 
subgroup of Gl(k, F") together with some G— valued cocycle {g a p} such that 
fpa,x(u) = p(g a p(x)). Actually, we may first have to replace {U a , f a } by a 
“refinement” ; a notion we now define: 


Definition 6.13 A refinement of a VB-atlas { U a , f a } is a VB-atlas {V a >, 
such that for every a ' £ A' there is a {U a , f a } such that U a C V a > and 

ha' | u — fa ■ 

We consider the refined atlas to be equivalent to the original. In fact, there 
is more to the notion of equivalence as the next result shows. For this theorem 
we comment that any two G r -compatible atlases on a vector bundle have a 
common refinement (G r -compatible) which the first two). 


Theorem 6.2 Let £i = (F fc Ei — > M) and £2 = (F fe £2 M) be 
two vector bundles over M. Let {/a/3} and {h af 3} be the respective transition 
maps (which we assume to be over the same cover). Then £ 2 is vector bundle 
isomorphic to £1 if and only if there exist functions A a : U a —> Gl(k, F") such 
that 

f a@ — Aa/la^A^ 


whenever U a fl Up. 


Example 6.1 (Canonical line bundle) Recall that P"(R) is the set of all 

lines through the origin in R" +1 . Define the subset L(P”(R)) of P n (M.) x R" +1 
consisting of all pairs ( l,v ) such that v £ l (think about this). This set together 
with the map 7rpn( R ) : (l,v) l is a rank one vector bundle. The bundle charts 
are of the form 7TpA R , (Pj)^^ where tpi ■ 1— > (ipi(l),pr.i(v)) £ R" x R. 


Example 6.2 (Tautological Bundle) Let G(n. k) denote the Grassmannian 
manifold of k-planes in R™. Let be the subset of G(n,k) x R" consisting of 
pairs (P,v) where P is a k-plane (k- dimensional subspace) and v is a vector in 
the plane P. The projection of the bundle is simply (P,v) 1— > P. We leave it to 
the reader to discover an appropriate VB-atlas. 


Note well that these vector bundles are not just trivial bundles and in fact 
their topology (for large n) is of the up most importance for the topology of 
other vector bundles. One may take the inclusions ...R n C R n+1 C ... C R°° to 
construct inclusions ...G(n, k) C G(n + 1, fc)...and ...7 C 7n+i,fe--from which a 
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“universal bundle” 7 „ — •> G(n) is constructed with the property that every rank 
k vector bundle E over X is the pull back by some map / : X — > G(n ) : 

E = f* 7n ->■ In 

I I 

X -4 G(n) 

6.1 Transitions Maps and Structure 

Now we come to an important point. Suppose we have a (locally trivial) fiber 
bundle £ := (E, 7 r, X).We have seen that we can find a covering of the base space 
X by open sets U a with corresponding trivializing maps f a : Ejj a — > U a x F. 
We have also noted that whenever U a D Up yf 0 we have, for every point x £ 
U a n Up. transition maps f a ,p,x '■ F —> F. These may be may be seen to arise 
through the following composition: 

y (x,y) ^ f a ° U\ ~ E ] (x,y) = {x,f a p tX {y)) ^ f a p, x (y) 

Thus f a ,/ 3 ,x is essentially the restriction of f a o /j) 1 to {x} x F. If the maps 
f a j 3 : U a fl Up — > Diff r (F ) given by x f a ,/ 3 ,x often take values in some 
subgroup G C Dif f r (F) which exists because the fiber F has some structure. 
For example it might be a finite dimensional F- vector space (say F fc ) and then 
we hope to find a cover by trivializations ( f a ,U a ) resulting in transition maps 
taking values in Gl(k, F). 

Exercise 6.4 Show that if we have this situation; f a> p : U a D Up — ► Gl(k, F) 
whenever U a fl Up ^ 0 then there is defined a natural vector space structure on 
each fiber E x and that we have a vector bundle as defined above in 6.12. 

As we indicated above, for a smooth (or C r , r > 0) vector bundle require 
that all the maps are smooth (or C r , r > 0) and in particular we require that 
/a/3 : U a fl U /3 — » GL(k, F) are all smooth. 

Exercise 6.5 To each point on a sphere attach the space of all vectors normal 
to the sphere at that point. Show that this normal bundle is in fact a (smooth) 
vector bundle. Also, in anticipation of the next section, do the same for the 
union of tangent planes to the sphere. 

Exercise 6.6 Let Y = R x R. and define let ( 27 , 2 / 1 ) ~ ( 27 , 2 / 2 ) if and only if 
X\ = 27 + jk and 2/1 = (— l) :,fc 2/2 for some integer k. Show that E := Y / ~ 
is a vector bundle of rank 1 which is trivial if and only if j is even. Convince 
yourself that this is the Mobius band when j is odd. 

6.2 Useful ways to think about vector bundles 

It is often useful to think about vector bundles as a family of vector spaces 
parameterized by some topological space. So we just have a continuously varying 
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Figure 6.1: Circle bundle. Schematic for fiber bundle. 


family V x : x £ X. The fun comes when we explore the consequences of how 
these V x fit together in a topological sense. We will study the idea of a tangent 
bundle below but for the case of a surface such as a sphere we can define the 
tangent bundle rather directly. For example, we may let TS 2 = {(x,v) : x £ S 2 
and (a:, v) = 0}. Thus the tangent vectors are pairs (x, v) where the x just tells 
us which point on the sphere we are at. The topology we put on TS 2 is the 
one we get by thinking of TS 2 as a subset of all pair, namely as a subset of 
R 3 xl 3 . The projection map is tt : (x,v) i— > x. 

Now there is another way we can associate a two dimensional vector space 
to each point of S 2 . For this just associate a fixed copy of R 2 to each a; £ S' 2 by 
taking the product space S 2 x R 2 . This too is just a set of pairs but the topology 
is the product topology. Could it be that S 2 x R 2 — > S 2 and T S 2 are equivalent 
as vector bundles? The answer is no. The reason is that S 2 x R 2 — » S 2 is the 
trivial bundle and so have a nowhere zero section: 

x e- > (x, v) 

where v is any nonzero vector in R 2 . On the other hand it can be shown using 
the techniques of algebraic topology that T S 2 does not support a nowhere zero 
section. It follows that there can be no bundle isomorphism between the tangent 
bundle of the sphere n : TS 2 — * S 2 and the trivial bundle pr\ : S 2 x R 2 -t S 2 . 

Recall that our original definition of a vector bundle (of rank k) did not 
include explicit reference to transition functions but they exist nonetheless. 



96 


CHAPTER 6. FIBER BUNDLES AND VECTOR BUNDLES I 


Accordingly, another way to picture a vector bundle is to think about it as 
a bunch of trivial bundles U a x R fc (or U a x C fc for a complex bundle)) to- 
gether with “change of coordinate” maps: U a x l fc — > Up x given by 
(x,v) i— > (x, g a f}(x)v). So a point (vector) in a vector bundle from a's view is 
a pair (x, v) while if U a fl Up 0 then from P's viewpoint it is the pair (x, w) 
where w = g a f){x)v. Now if this really does describe a vector bundle then the 
maps g a p : U a fl Up — > G7(fc, R) (or Gl(k, C)) must satisfy certain consistency 
relations: 


gaa(x) = id for x £ U a 
g<xp{x)gpoc{x) = id for X £U a r\Uf 3 
g a p(x)g 0J (x)g ia (x) = id for x G U a n Up n U 1 . 


( 6 . 2 ) 


A system of maps g a p satisfying these relations is called a cocycle for the bundle 
atlas. 


Before 


After 


Example 6.3 Here is how the tangent bundle of the sphere looks in this picture: 
Let U XVt+ = {( x,y,z ) G S 2 : z > 0} 
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Then for example g y + tZ +(x, y, z) is the matrix 

( , ii x \ 

V ~y 2 w v \/ x -x 2 -y 2 ) 

= ( 1 0 ) 

\ ~x/z - y/z ) 

the rest are easily calculated using the Jacobian of the change of coordinate maps. 

Or using just two open sets coming from stereographic projection: let U\ 
be the open subset {( x,y,z ) G S 2 : z ^ —1} and let U 2 be the open subset 
{( x,y,z ) G S' 2 : z 7 ^ 1}. Now we take the two trivial bundles U\ x R 2 and 
U 2 x R 2 and then describe < 712 (/?) for p = (x, y, z) G U\ fl U 2 . 

gi2{p)-{v 1 ,V 2 ) t = {w 1 ,w 2 ) t 

where h\ 2 (p) is the matrix 


f ~{x 2 - y 2 ) -2 xy \ 

V ~ 2x y x 2 - y 2 ) ' 

This last matrix is not so easy to recognize. Can you see where I got it? 

Theorem 6.3 Let M be a C r — space and suppose there is given an open cover 
{[/ojaei of M together with C r —maps g a p : U a fl Up — > GL(F fc ) satisfying the 
cocycle relations. Then there is a vector bundle tte '■ E — » M that has these as 
transition maps. 


6.3 Sections of a Vector Bundle 

It is a general fact that if {V x } xe x is any family of F— vector spaces and !F{X, F) 
is the vector space of all F-valued functions on the index set X then the set of 
maps (sections) of the form a : x 1 — > a(x) G 14 is a module over T(X. F) the 
operation being the obvious one. For us, the index set is some C r — space and 
the family has the structure of a vector bundle. Let us restrict attention to C°° 
bundles. The space of sections T(£) of a bundle f = (F fe °-s- E — > M) is a 
vector space over F but also a over module over the ring of smooth functions 
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C°°(M). The scalar multiplication is the obvious one: (fa)(x) := f(x)a(x). 
Of course we also have local sections a : U — > E defined only on some open 
set. Piecing together local sections into global sections is whole topic in itself 
and leads to several topological constructions and in particular sheaf theory and 
sheaf cohomology. The first thing to observe about global sections of a vector 
bundle is that we always have plenty of them. But as we have seen, it is a 
different matter entirely if we are asking for global sections that never vanish; 
that is, sections a for which a(x ) is never the zero element of the fiber E x . 

6.4 Sheaves, Germs and Jets 

In this section we introduce some formalism that will not only provide some 
convenient language but also provides conceptual tools. The objects we intro- 
duce here form an whole area of mathematics and may be put to uses far more 
sophisticated than we do here. Some professional mathematicians may find it 
vaguely offensive that we introduce these concepts without an absolute neces- 
sity for some specific and central problems. However, the conceptual usefulness 
of the ideas go a long way toward helping us organize our thoughts on several 
issues. So we persist. 

There is a interplay in geometry and topology between local and global data. 
To see what the various meanings of the word local might be let consider a map 
F : C°°(M ) — > C°°(M) which is not necessarily linear. For any / £ C°°(M ) we 
have a function F(f) and its value (F(f))(p) £ R at some point p € M. Let us 
consider in turn the following situations: 

1. It just might be the case that whenever / and g agree on some neighbor- 
hood of p then (F(/))(p) = (P(g)){p)- So all that matters for determining 
( F(f))(p ) is the behavior of / in any arbitrarily small open set containing 
p. To describe this we say that ( F(f))(j> ) only depends on the “germ” of 
/ at p. 

2. Certainly if / and g agree on some neighborhood of p then they have the 
same Taylor expansion at p. The reverse is not true however. Suppose 
that whenever two functions / and g have Taylor series which agree up to 
and including terms of order |x| fc then (F(f)){p) = (P{g)){p)- Then we 
say that (F(/))(p) depends only on the k— jet of / at p. 

3. If ( F(f))(p ) = (F(g))(p) whenever df | = dg | we have a special case of 
the previous situation since df | = dg | exactly when / and g have Taylor 
series which agree up to and including terms of order 1. So we are talking 
about the “1— jet”. 

4. Finally, it might be the case that ( F(f))(p ) = (F(g))(p) exactly when p. 

Of course it is also possible that none of the above hold at any point. Notice 
as we go down the list we are saying that the information needed to deter- 
mine (F(f))(p) is becoming more and more local; even to the point of being 
infinitesimal. 
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Here is a simple example. Let F : C°°(K) — » C°°(K) be defined by F(f) := 
/'. Then it is pretty clear that F(f)(t) depends only on the 1-jet at t (for any 
t). 

Exercise 6.7 Let ( a tJ )l<i,j<n be a matrix of smooth functions on K" and also 
let b 1 , ...b n ,a be smooth functions on R n . Observe that the map L : C°°(]R n ) — > 
C 00 (K”) by 

L(/)= ^ a b^!/ +^ 6 i|4 + a 

^ dx'dx'- 1 ^ dx l 

1,3 * 

is such that for any x £ R" the value L(f)(x) depends only on the 2— jet of f at 
x. Show that any linear map L : C'°°(R") — > C°°(R") which has this property 
must have the above form. 

Now let us consider a section a : M — > E of a vector bundle E. Given any 
open set U C M we may always produce the restricted section a\ u : U — > E. 
This gives us a family of sections; on for each open set U. To reverse the 
situation, suppose that we have a family of sections ajj : U — > E where U varies 
over the open sets (or just a cover of M). When is it the case that such a family 
is just the family of restrictions of some (global) section a : M —> El This is 
one of the basic questions of sheaf theory. . 

Definition 6.14 A presheaf of abelian groups (resp. rings etc.) on a 

manifold (or more generally a topological space M is an assignment M. (U) to 
each open set U C M together with a family of abelian group (resp. ring etc.) 
homomorphisms ry : M.(U) — * A4(V) for each nested pair V C U of open sets 
and such that 

Presheaf 1 o r\) = whenever W C V C U. 

Presheaf 2 ry = idy for all open V C M. 

Definition 6.15 Let A4 be a presheaf and 1Z a presheaf of rings. If for each 
open U C M we have that A4(U) is a module over the ring JZ{U) and if the 
multiplication map 1Z(U) x M(U) — » M{U) commutes with the restriction maps 
r^y then we say that A4 is a presheaf of modules over 1Z. 

The best and most important example of a presheaf is the assignment U i— > 
T(E,U) for some vector bundle E —> M and where by definition ry( s ) = s lu 
for s £ T(E,U). In other words ry is just the restriction map. Let us denote 
this presheaf by Se so that Se(U) = T(E, U). 

Definition 6.16 A presheaf homomorphism h : M.\ —> M .2 is an is an as- 
signment to each open set U C M an abelian group (resp. ring, module, etc.) 
morphism hu : A4\{U) — » M.i (U) such that whenever V C U then the following 
diagram commutes: 

M^U) h -H M 2 (U) 
ry I Ty I 

Mi(V) ^ M 2 (V) 
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Note we have used the same notation for the restriction maps of both presheaves. 

Definition 6.17 We will call a presheaf a sheaf if the following properties 
hold whenever U = (J^ U a for some collection of open sets U. 

Sheaf 1 If si, S 2 G A4(U) and r^ s\ = r E s 2 for all U a GU then si = s 2 . 

Sheaf 2 If s a G A4(U a ) and whenever U a D Up ^ 0 we have 

r U a nU /3 S a = r U a nU 0 S l3 

then there exists a s G A4 (U) such that r E s = s a . 

If we need to indicate the space M involved we will write A 4m instead of 

M. 

It is easy to see that all of the following examples are sheaves. In each case 
the maps are just the restriction maps. 

Example 6.4 (Sheaf of smooth functions) Cf^(U) = C°°(U). Notice that 
Cm is a sheaf of modules over itself. Also C^fU) is sometimes denoted by 
Fm(U). 

Example 6.5 (Sheaf of holomorphic functions ) Sheaf theory really shows 
its strength in complex analysis. This example is one of the most studied. How- 
ever, we have not yet defined the notion of a complex manifold and so this 
example is for those readers with some exposure to complex manifolds. Let M 
be a complex manifold and let Om(U) be the algebra of holomorphic functions 
defined on U. Here too, Om is a sheaf of modules over itself. Where as the sheaf 
0^ always has global sections, the same is not true for Om ■ The sheaf theoretic 
approach to the study of obstructions to the existence of global sections has been 
very successful. 

Example 6.6 (Sheaf of continuous functions) C(U) = C(U) 

Example 6.7 (Sheaf of smooth sections) Sm(U) = T(U,E) for a vector 
bundle E — > M. Here Sm is a sheaf of modules over Cm- 

Remark 6.1 ( Notational convention !) Even though sheaf theory is a deep 
subject, one of our main purposes for introducing it is for notational (and con- 
ceptual) convenience. Each of the above presheaves is a presheaf of sections. 
As we proceed into the theory of differentiable manifolds we will meet many in- 
stances where theorems stated for sections are for both globally defined sections 
and for sections over opens sets and many times the results will be natural in 
one way or another with respect to restrictions. For this reason we might simply 
write Sff ,C°° etc. whenever we are proving something that would work globally 
or locally in a natural way. In other words, when the reader sees S or Cm he 
or she should think about both S E (U) and S E (M) or both C°°(U) and C°°(M ) 
and so on. 
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We say that Si € S E (U ) and s 2 G S E (V) determine the same germ of 
sections at p if there is an open set W C U n V such that rjySi = r^-s 2 . Now 
on the union 

U s E w 

peu 

we impose the equivalence relation .sq ~ s 2 iff si and s 2 determine the same germ 
of sections at p. The set of equivalence classes (called germs of section at p) a 
ring and is denoted S E . The set S E ((U)) = (J £l7 S. E is called the sheaf of germs 
and can be given a topology so that the projection pr$ :S E ((U)) — > M defined 
by the requirement that pr$([s]) = p iff [s] G S E is a local homeomorphism. 

More generally, let A4 be a presheaf of abelian groups on M. For each p G M 
we define the direct limit group 

M p = lim M(U) 

peu 

with respect to the restriction maps r E . 

Definition 6.18 A4 P is a set of equivalence classes called germs at p. Here 
si G A 4{U) and s 2 £ A4(V) determine the same germ of sections at p if there 
is an open set W C U fl V containing p such that r\ySi = rj^s 2 . The germ of 
s £ A4 (U) at p is denoted s p . 

Now we take to union M = \J peM M p and define a surjection tt : M — » M 
by the requirement that 7r(s p ) = p for s p £ M p . The space M is called the 
sheaf of germs generated by M. We want to topologize A4 so that 7r is 
continuous and a local homeomorphism but first a definition. 

Definition 6.19 (etale space) A topological space Y together with a contin- 
uous surjection n : Y — > M which is a local homeomorphism is called an etale 
space. A local section of an etale space over an open subset U C M is a map 
Sjj '. U — > Y such that tt o sjj = idf/ . The set of all such sections over U is 
denoted T(U, Y). 

Definition 6.20 For each s £ M{U) we can define a map (of sets) s : U — » 
M by 

s(x) = s x 

and we give M. the smallest topology such that the images s(U) for all possible 
U and s are open subsets of M. 

With the above topology M becomes <m etale space and all the sections s 
are continuous open maps. Now if we let A4(U) denote the sections over U for 
this etale space, then the assignment U — > A4 (U) is a presheaf which is always 
a sheaf. 

Proposition 6.1 If M. was a sheaf then M. is isomorphic as a sheaf to A4. 
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The notion of a germ not only makes sense of sections of a bundle but we 
also have the following definition: 

Definition 6.21 Let F p (M,N) be the set of all smooth maps f .: M,p — + N 
which are locally define near p. We will say that two such functions f± and f 2 
have the same germ atp if they are equal on some open subset of the intersection 
of their domains which also contains the point p. In other words they agree in 
some neighborhood of p. This defines an equivalence relation on F p (M,N) and 
the equivalence classes are called germs of maps. The set of all such germs of 
maps from M to N is denoted by F P (M, N). 

It is easy to see that for [fi], [f 2 \ € F p (M, F) where F = R or C, the product 
[,/i] [./s] = [/ 1 / 2 ] is well defined as are the linear operations a[/i] + b[f 2 \ = 
[afi + bf 2 }. One can easily see that F P (M, F) is a commutative ring. In fact, 
it is an algebra over F. Notice also that in any case we have a well defined 
evaluation [f](p) = f(p). 

Exercise 6.8 Reformulate Theorem 26.12 in terms of germs of maps. 


6.5 Jets and Jet bundles 

A map / :: E,x — + F, y is said to have fc-th order contact at x with a map 
gw E, x — > F, y if / and g have the same Taylor polynomial of order k at x. This 
notion defines an equivalence class on the set of (germs) of functions E, x — > F, y 
and the equivalence classes are called fc-jets. The equivalence class of a function 
/ is denoted by j k f and the space of fc-jets of maps E,x — > F,y is denoted 
J k (E,F) y . We have the following disjoint unions: 

J^(E,F)= U J k (E.F) y 

J/6F 

j 1 (e,f) 9 =U4(ef) 9 

x£E 

J fc (E,F)= |J J*( E,F)„ 

rcSE.yGF 


Now for a (germ of a) smooth map / :: M,x — > N, y we say that / has fc-th 
order contact at x with a another map g :: M, x —> N, y il (f>o f oip has fc-th order 
contact at ip(x) with (f>o g o if for some (and hence all) charts <p and if defined 
in a neighborhood of x £ M and y £ N respectively. We can then define sets 
J%(M,N), J k (M, N) y , and J k (M,N). The space J k (M,N) is called the space 
of jets of maps from M to N and turns out to be a smooth vector bundle. In 
fact, a pair of charts as above determines a chart for J k (M, N) defined by 

<t>) ■ Jxf ^ {4>{x),$U(x)),D(4> 0 / 01 )), ..., D k (cj)o f o if)) 
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where Dp(p o f o ip) g L J sym ( K", N). In finite dimensions, the chart looks like 


J k (ip,<p) : j k f ( x 1 ,...,x n ,y l 


8xD ’’ 


d a p 

dx a 


(*)) 


where |a| = k. Notice that the chart has domain [/,/, x U^. 

Exercise 6.9 What vector space is naturally the typical fiber of this vector bun- 
dle? 


Definition 6.22 The k-jet extension of a map f : M — > N is defined by 
j k f : x i— > j k f g J k (M,N). There is a strong transversality theorem that uses 
the concept of a jet space: 

Theorem 6.4 (Strong Transversality) Let S be a closed submanifold of J k (M, N). 
Then the set of maps such that the k-jet extensions j k f are transverse to S is 
an open everywhere dense subset in C°°(M,N). 
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Chapter 7 


Vector Fields and 1-Forms 


7.1 Definition of vector fields and 1-forms 

Definition 7.1 A smooth vector field is a smooth map X : M — > TM such 
that X(p) £ T p M for all p £ M. We often write X(j>) = X p . In other words, a 
vector field on M is a smooth section of the tangent bundle tm '■ TM — » M. 

The map X being smooth is equivalent to the requirement that Xf:M—>M. 
given by p i— > X p f is smooth whenever / : M — > K is smooth. 

If (x, U) is a chart and X a vector field defined on U then the local represen- 
tation of X is x i— > (x, X(x)) where the principal representative (or principal 
part) X is given by projecting TxoX ox~ 1 onto the second factor in T E = E x E: 

x i— > x _1 (x) = p i— ► X(p) i— > Tx ■ X(p) 

= (x(p),X(x(p))) = (x, X(x)) ^ X(x) 

In finite dimensions one can write X(x) = (zq(x), ...,v n (x)). 

Notation 7.1 The set of all smooth vector fields on M is denoted byT(M,TM) 
or by the common notation £(M). Smooth vector fields may at times be defined 
only on some open set so we also have the notation X(U) = Xm{U) for these 
fields. The map U i— > X M (U) is a presheaf (in fact a sheaf). 


A (smooth) section of the cotangent bundle is called a covector field or 
also a smooth 1-form . The set of all C r 1-forms is denoted by X r *(M) with 
the smooth 1-forms denoted by £*(M). 

X* ( M ) is a module over the ring of functions C°°(M) with a similar state- 
ment for the C r case. 

Definition 7.2 Let f : M — > K be a smooth function with r > 1. The map 
df : M — > T*M defined by p i— > df(p) where df(p) is the differential at p as 
defined in 3.3. is a 1-form called the differential of f. 
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7.2 Pull back and push forward of functions and 
1-forms 

If (f> : N —> M is a C r map with r > 1 and / : M — > M a C r function we define 
the pullback of / by </> as 

Pf = f°<t> 

and the pullback of a 1-form a £ X*(M) by <p*a = ao T<f>. To get a clear picture 
of what is going on we could view things at a point and we have <p*a | • v = 

a \<k(v) ‘ ( T p^’ u )- 

The pull-back of a function or 1-form is defined whether <f> : N M happens 
to be a diffeomorphism or not. On the other hand, when we define the pull-back 
of a vector held in a later section we will only be able to do this if the map that 
we are using is a diffeomorphism. Push- forward is another matter. 

Definition 7.3 If <f> : N — > M is a C r diffeomorphism with r > 1. The push- 
forward of a function </>*/ by <p*f(p) := f(cj)^ 1 (p)). We can also define the 
push-forward of a 1-form as (f>*a = a o Tf -1 . 

It should be clear that the pull-back is the more natural of the two when it 
comes to forms and functions but in the case of vector fields this is not true. 

Lemma 7.1 The differential is natural with respect to pidlback. In other words, 
if (f> : N — > M is a C r map with r > 1 and f : M — > R a C r function with r > 1 
then d(<f>* f) = cjfdf. Consequently, differential is also natural with respect to 
restrictions 

Proof. Let v be a curve such that c(0) = v. Then 

d{4>* f)(y) = j t 4>* f(c(t)) = j f f{<j>(c(t))) 

= d/4 = dfiTf-v) 

dt 0 

As for the second statement (besides being obvious from local coordinate ex- 
pressions) notice that if U is open in M and t : U ^ M is the inclusion map 
(identity map idM ) restricted to U) then f\ v = t* f and df\ v = Cdf this part 
follows from the first part. ■ 

We also have the following familiar looking formula in the finite dimensional 
case 

df = 'S'' ^J—dx 1 

J ^ dx l 

which means that at each p £ U a 

dm = y dx i \ . 

y ’ A- 1 dx l 'p 
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In general, if we have a chart U, x then we may write 

7 , df 

dj = tt - dx 
ox 

We have seen this before. All that has happened is that p is allowed to vary so 
we have a field. 

For any open set U C M, the set of smooth functions defined C°°(U) on U 
is an algebra under the obvious linear structure (a/ + bg)(p ) := af(p ) + bg{p) 
and obvious multiplication; ( fg)(p ) := f(j>)g(p)- When we think of C°°(U) in 
this way we sometimes denote it by C°°(?7).The assignment U i— > X M (U) is a 
presheaf of modules over C°°. 

7.3 Frame Fields 

If U, x is a chart on a smooth n-manifold then writing x = (a; 1 , ...,x n ) we have 
vector fields defined on U by 

d d 

r : p i— > r 

dx 1 dx 1 p 

such that the together the ^ form a basis at each tangent space at point in 
U. We call the set of fields g^r, ..., g^ a holonomic frame held over U. If X is 
a vector held defined on some set including this local chart domain U then for 
some smooth functions X 1 dehned on U we have 

X(p) = £*'&>) A 


or in other words 

X\ v = Y X i -^—. 
u ' dx 1 

Notice also that dx 1 : p dx 1 1 defines a field of co- vectors such that 

'p 

dx l \ ..... dx n I forms a basis of T*M for each p E U. The fields form what is 
i p 1 1 'p p r 

called a holonomic 1 co-frame over U. In fact, the functions X 1 are given by 

dx l {X) : p i — ► dx l \ p (A p ). 


Notation 7.2 We will not usually bother to distinguish X from its restrictions 
and so we just write X = or using the Einstein summation convention 

x = x i y. 

OX 1 


It is important to realize that it is possible to have family of helds that 
are linearly independent at each point in their mutual domain and yet are not 
necessarily of the form g^ for any coordinate chart. This leads to the following 

1 The word holonomic comes from mechanics and just means that the frame field derives 
from a chart. A related fact is that \jjf- ■ fjj] = 0. 



108 


CHAPTER 7. VECTOR FIELDS AND 1-FORMS 


Definition 7.4 Let F\, F 2 , ..., F n be smooth vector fields defined on some open 
subset U of a smooth n-manifold M. If Fi(j>), F^Cp), F n (p) from a basis for 
T p M for each p £ U then we say that Fi, F 2 , ..., F n is a (non-holonomic) frame 
field over U . 


If F-[.F' 2 } F n is frame field over U C M and X is a vector field defined on 
U then we may write 

X = Y^ X ip i on U 

for some functions X z defined on U. Taking the dual basis in T*M for each 
p £ U we get a (non-holonomic) co- frame field F 1 , F" and then X 1 = F l (X). 

Definition 7.5 A derivation on C°°(U) is a linear map V : C°°(U) — » C°°(U) 
such that 

V(fg) = V(f)g + fV(g). 

A C°° vector field on U may be considered as a derivation on X(U) where 
we view X(U) as a module over the ring of smooth functions C°°(U). 

Definition 7.6 To a vector field X on U we associate the map Cx '■ Xm(U) — » 
%m(U) defined by 

(C x f)(p) :=X p -f 

and called the Lie derivative on functions. 

It is easy to see, based on the Leibnitz rule established for vectors X p in 
individual tangent spaces, that Cx is a derivation on C°°(U). We also define 
the symbolism “X/”, where X £ X(U), to be an abbreviation for the function 
Cx f ■ We often leave out parentheses and just write Xf(p) instead of the more 
careful ( Xf)(p ) and so, for example the derivation law (Leibnitz rule ) reads 

X(fg) = fXg + gXf. 


7.4 Lie Bracket 

Lemma 7.2 Let U C M be an open set. If Cx f(p) = 0 for all f £ C°°(U ) 
and all p £ U then X\ v = 0. 

Proof. Working locally in a chart (x, U), let X be the principal represen- 
tative of X (defined in section 7.1). Suppose that i : E — » R is a continu- 
ous linear map such that £(X(p)) ^ 0 (Hahn-Banach). Then D p t — I and 
d(x*J p )(X( P )) = D p FX^ 0 ■ 

Theorem 7.1 For every pair of vector fields X,Y £ X(M) there is a unique 
vector field [X, Y] which for any open set U C M and f £ C°°(U) we have 
[X, y]( p )/ = X p (Y f) — Y p (Xf) and such that in a local chart U,x the vector 
field [X, Y] has principal part given by 


DY ■ X - DX ■ Y 
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or more fully if f £ C°°(U) then letting f := / o x 1 be the local representative 
we have 

([X, Y]f)(x) = Df ■ (DY(x) • X(x) - DX(x) ■ Y(x)) 
where x ~ p and [X, Y]f means the local representative of [X,Y]f. 


Proof. We sketch the main points and let the reader fill in the details. 

One can prove that the formula / i— > Df(DY ■ X — DX ■ Y) defines a derivation 
locally. Also, DY ■ X — DX ■ Y transforms correctly and so gives a vector at 
each point. Thus by pulling back via charts we get a vector field on each chart 
domain. But these agree on overlaps because they are all coming from the 
local representations of a single derivation / i— > X p (Y f) — Y p (Xf). In the finite 
dimensional case we can see the result from another point of view. Namely, 
every derivation of C°° ( M ) is Cx for some X so we can define [X, Y] as the 
unique vector field such that £\x,y] = £x 0 ~ £y 0 C-x ■ ■ 

We ought to see what the local formula for the Lie derivative looks like in 
the finite dimensional case where we may employ classical notation. Suppose we 
have X = i X‘^ and Y = E™i Then [X, Y] = £. (E, - ^ Yl ) 


d 

dx z 


Exercise 7.1 Check this. 


Definition 7.7 The vector field [X, Y] from the previous theorem is called the 
Lie bracket of X and Y. 

The following properties of the Lie Bracket are checked by direct calculation. 
For any X, Y, Z G X(M), 

1. [X, Y\ = — [Y, X] 

2. [X, [X, Z]\+ [X, [Z, X]] + [X, [X, X]] = 0 

Definition 7.8 (Lie Algebra) A vector space a is called a Lie algebra if it is 

equipped with a bilinear map a x a — > a (a multiplication) denoted v, w i— > [u, w] 
such that 

[v, w] = — \w, v] 

and such that we have a Jacobi identity 


[x, [y, z]\ + [y, [z, *]] + [z, [x, y ]] = 0 


for all x,y,z £ a. 

We have seen that X(M) (or X(U)) is a Lie algebra with the Bracket defined 
in definition 7.7. 
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7.5 Localization 

It is a fact that if M is finite dimensional then every derivation of C°°(M) is 
given by some vector field and in fact the association X i— ► Lx is a bijection 
from X(M) onto the set of all derivations of the algebra of C°° = $(M) functions 
C°°(M). Actually, this is true not just for finite dimensional manifolds but also 
for manifolds modelled on a special class of Banach spaces which include Hilbert 
spaces. In order to get an isomorphism between the set of all derivations of the 
algebra C°°(M) = $(M) we need to be able to construct appropriate bump 
functions or more specifically cut-off functions. 

Definition 7.9 Let K be a closed subset of M contained in an open subset 
U C M. A cut-off function for the nested pair K C U is a C°° function 
0 : M — > R such that 0\ K = 1 and 0\ M \u = 0. 

Definition 7.10 A manifold M is said to admit cut-off functions if given 
any point p £ M and any open neighborhood U of p, there is another neighbor- 
hood V of p such that V C U and a cut-off function 0y v for the nested pair 
VcU. 

Lemma 7.3 All finite dimensional smooth manifolds admit cut-off functions. 
Proof. Exercise ■ 


Definition 7.11 Let E be a Banach space and suppose that the norm on E is 
smooth (resp.C r ). The we say that E is a smooth (resp. C r ) Banach space. 

Lemma 7.4 If E is a smooth (resp. C r ) Banach space and B r C Br nested 
open balls then there is a smooth (resp. C r ) function 0 defined on all of E which 
is identically equal to 1 on the closure B r and zero outside of Br. 


Proof. We assume with out loss of generality that the balls are centered at 
the origin 0 £ E. Let 


Ms) 


Hoc 9(t)dt 

IZo MM 


where 


M) 


exp(— 1/(1 — |t| 2 ) if \t\ < 1 
0 otherwise 


This is a smooth function and is zero if s < — 1 and 1 if s > 1 (verify). Now let 
0(x) = g( 2 — |x|). Check that this does the job using the fact that x i— > |x| is 
assumed to be smooth (resp. C r ). ■ 


Corollary 7.1 If a manifold M is modelled on a smooth (resp. C r ) Banach 
space E. (in particular, if M is an n-manifold), then for every a p £ T*M, there 
is a (global) smooth (resp. C r ) function f such that. Df | = a p . 
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Proof. Let Xo = ip (p) £ R" for some chart ip,U. Then the local represen- 
tative a X0 = (ip -1 )* a p can be considered a linear function on K n since we have 
the canonical identification R” = {xq}x R”= T Xo R”. Thus we can define 


^(x) 


/3(x)o X0 (x) for x£B R (x 0 ) 
0 otherwise 


and now making sure that R is small enough that B R (x o) C ip(U) we can 
transfer this function back to M via ip -1 and extend to zero outside of U get /. 
Now the differential of <p at Xo is a Xo and so we have for v £ T p M 

df{p) ■ v = d(ip*ip)(p) ■ v 
= ( ip*d<p)(p)v 
dp{T p ip ■ v ) 

= a Xo (T p ip ■ v ) = (ip -1 )* a p (Tpip ■ v ) 

= a p (Tip- l T p ip ■ v ) = a p (v) 


so df(p) =a p m 

Lemma 7.5 The map from X(M) to the vector space of derivations Der(M) 
given by X i— > C x is a module monomorphism if M is modelled on a C°° 
Banach space. 

Proof. The fact that the map is a module map is straightforward. We just 
need to get the injectivity. For that, suppose Cx / = 0 for all / £ C°°(M). 
Then Df\ p X p = 0 for all p £ M. Thus by corollary 7.1 a p (X p ) = 0 for all 
a p £ T*M. By the Hahn Banach theorem this means that X p = 0. Since p was 
arbitrary we concluded that X = 0. ■ 

Theorem 7.2 Let L : 3L(M) C°°(M) be a C°° (M)— linear function on vector 

fields. If M admits cut off functions then L(X)(p) depends only on the germ of 
X at p. 

If M is finite dimensional then L(X)(p) depends only on the value of X at 
P- 


Proof. Suppose X = 0 in a neighborhood U and let p £ U be arbitrary. 
Let O be a smaller open set containing with closure inside U. Then letting (3 
be a function that is 1 on a neighborhood of p contained in O and identically 
zero outside of O then (1 — fT)X = X. Thus we have 

L(X)(p) = L((l - p)X)(p) 

= (l-f)(p))L(X)(p)=OxL(X)(p) 

= 0 . 

Applying this to X — Y we see that if two fields X and Y agree in an open set 
then L(X) = L(Y) on the same open set. The result follows from this. 
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Now suppose that M is finite dimensional and suppose that X(p) = 0. 
Write X = X 1 in some chart domain containing p with smooth function X 1 
satisfying X l {p) = 0. Letting (3 be as above we have 

f3 2 L(X) = flX l L{f3^-) 

which evaluated at p gives 

L(X)(p) = 0 

since j3{p) = 1. Applying this to X — Y we see that if two fields X and Y agree 
at p then L(X)(p) = L(Y)(p). m 

Corollary 7.2 If M is finite dimensional and L : X(M) — > is a 

C°° (M) — linear function on vector fields then there exists an element a £ X*(M) 
such that a(X) = L(X) for all X £ X(M). 

Remark 7.1 (Convention) Even though a great deal of what we do does not 
depend on the existence of cut off functions there are several places where we 
would like to be able to localize. 


7.6 Action by pullback and push-forward 

Given a diffeomorphism <f> : M — > N we define the pull back <j>*Y £ X(M) 
for Y £ X(N) and the push-forward (j>*X £ X(N) of X £ X(M) via <j> by 
defining 

(f>*Y = oYofi and 
(t>*X = T<j>o lof 1 . 

In other words, ((f>*Y)(p) = Tfi -1 ■ Y^p and {<f> if X)(p) = T(f> ■ X^-i^ . Notice 
that (f>*Y and <j>*X are both smooth vector fields. Let (f> : M — > N be a smooth 
map of manifolds. The following commutative diagrams summarize some of the 
concepts: 

T p M T ^t T^pN 

I I 

M,p N,<j)(p) 

TM ^ TN 

I I 

M -t N 

and if <f> is a diffeomorphism then 

X{M) h X(N) 

i i 


M 


N 
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and also 

X(M) £ X(N) 

I I 

M ■£ N 

Notice the arrow reversal. The association (f> i— > </>* is said to be contravariant 
as opposed to <j> i— > which is covariant. In fact we have the following facts 

concerning a composition of smooth diffeomorphisms M £ N £ P : 

(</> 0 /)* = <t>* ° f* covariant 

{4> o /)* = f* o <fi* contravariant 

If M = N, this gives a right and left pair of actions 2 of the diffeomorphism 
group Diff(M) on the space of vector fields: X(M) = T(M,TM). 

Diff(M) x X(M) -► X(M) 

x) i ► (/>*x 

and 


X(M) x Diff(M) -► X(M) 

(X,4>) ^(j>*X 

Now even if (f> : M — > N is not a diffeomorphism it still may be that there is 
a vector field Y £ X(N) such that 

T<j) o X = Y o (j>. 


Or on other words, T(j> ■ X p = Y^p, for all p in M. In this case we say that Y 
is ^-related to X and write X Y . 

Theorem 7.3 The Lie derivative on functions is natural with respect to pull- 
back and push-forward by diffeomorphisms. In other words, if <j> : M — > IV is a 
diffeomorphism and f £ C°°(M), g £ C°°(N), X £ X(M) and Y £ X(N) then 

£(f,*Y<P* g = <t>* Cyg 


and 


Proof. 


^■4>,x4>*f = <t>*£xf 


(C rY (f*g){p) = d(<j>*g)(<t>*Y){p) 

= (0*d< 7 )(T0- 1 Y(#)) = d 5 (T^- 1 Y(#)) 

= dg(Y(<fyp)) = {<j>* C Y g)(jp) 

The second statement follows from the first since (</> -1 )* = </>*. ■ 

In case the map (f> : M — > TV is not a diffeomorphism we still have a result 
when two vector fields are ^-related. 

2 For the definition of group action see the sections on Lie groups below. 
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Theorem 7.4 Let <f> : M — > N be a smooth map and suppose that X Y. 
Then we have for any g G C°°(N) Cx4>*g — <t>* C yQ- 

The proof is similar to the previous theorem and is left to the reader. 


7.7 Flows and Vector Fields 


All flows of vector fields near points where the field doesn’t vanish 
look the same. 

A family of diffeomorphisms 4> t : M — > M is called a (global) 
flow if t i— > is a group homomorphism from the additive group 

R. to the diffeomorphism group of M and such that <f> t (x) = 
gives a smooth map IxM -» M. A local flow is defined similarly 
except that 4>(f, x) may not be defined on all of Rx M but rather on 
some open set in RxM and so we explicitly require that 


1. o $ s = 4> t+s and 

2 . Qf 1 = 


for all t and s such that both sides of these equations are defined. 
Using a smooth local flow we can define a vector field A $ by 


**(p) 


d 

dt 


&(t,p) G T p M. 
o 


If one computes the velocity vector c(0) of the curve c : t i— ► a:) one gets 

X®(x). On the other hand, if we are given a smooth vector field X in open 
set U C M then we say that c : (a, b) — > M is an integral curve for X if 
c(t) = X(c(t)) for t G (a, b). 

Our study begins with a quick recounting of a basic existence and uniqueness 
theorem for differential equations stated here in the setting of Banach spaces. 
The proof may be found in Appendix B. 


Theorem 7.5 Let E be a Banach space and let X : U C E — ► E be a smooth 
map. Given any Xq G U there is a smooth curve c : (— e, e) — * U with c(0) = Xq 
such that c'(t) = X(c(t)) for all t G (— e,e). If C\ : (— ei,ei) —> U is another 
such curve with Ci(0) = Xq and c^ft) = X(c(t)) for all t G (— ei,ei) then 
c = Ci on the intersection (— ei,ei) D (— e, e). Furthermore, there is an open 
set V with Xq G V C U and a smooth map $ : V x (—a, a) — » U such that 
1 1 — > Ca;(t) := 4>(x,<) is a curve satisfying c' (t) = X(c(t)) for all t G (—a, a). 

We will now use this theorem to obtain similar but more global results on 
smooth manifolds. First of all we can get a more global version of uniqueness: 

Lemma 7.6 If c\ : (— ei,ei) — > M and C 2 : (— £ 2 , £ 2 ) — ► M are integral curves 
of a vector field X with Ci( 0 ) = 02(0) then C\ = C2 on the intersection of their 
domains. 
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Proof. Let K = {t. £ (— ei,ei) fl (—€ 2 , 62 ) : c\{t) = C 2 (t)}. The set K 
is closed since M is Hausdorff. If follows from the local theorem 7.5 that K 
contains a (small) open interval (— e, e). Now let to be any point in K and 
consider the translated curves c*°(t) = ci(to + t ) and c^it) = 02(^0 + t). These 
are also integral curves of X and agree at t = 0 and by 7.5 again we see 
that c*° = 0*2 on some open neighborhood of 0. But this means that ci and 
C 2 agree in this neighborhood so in fact this neighborhood is contained in K 
implying K is also open since to was an arbitrary point in K. Thus, since 
/ = (—ei, ei) D (— 62 , 62 ) is connected, it must be that I = K and so Ci and C 2 
agree on /= (-ei,ei) l~l (-e 2 ,e 2 ). ■ 


Flow box and Straightening 

Let X be a C r vector held on M with r > 1. A flow box for A at a point 
Po £ M is a triple ([/, a, F1‘ Y ) where 

1. U is an open set in M containing p. 

2. F1 a : U x (—a, a) — > M is a C r map and 0 < a < 00 . 

3. For each p £ M the curve f 1 — > c p (t) = Fl x (p,t) is an integral curve of X 
with c p (0) = p. 

4. The map F1 A : U — > M given by F1 A (p) = F1 A (p, t ) is a diffeomorphism 
onto its image for all t £ (—a, a). 

Now before we prove that flow boxes actually exist, we make the following 
observation: If we have a triple that satisfies 1-3 above then both Ci : t 1 — > 
Fl ^ +s (p) and C 2 : 1 1 — > Fl A (Fl A '(p)) are integral curves of X with ci(0) = 02 ( 0 ) = 
Fl A (p) so by uniqueness (Lemma 7.6) we conclude that Fl A (Fl A '(p)) = F1) A s (p) 
as long as they are defined. This also shows that 

Flf oFlf = Fl A s = Flf oFlf 
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whenever defined. This is the local group property, so called because if F1 A 
were defined for all i G R (and X a global vector field) then t i— > F1 A would 
be a group homomorphism from R into Diff(M). Whenever this happens we 
say that X is a complete vector field. The group property also implies that 
F1 A o Fl A t = id and so F1 A must at least be a locally defined diffeomorphism 
with inverse Fl A t . 

Theorem 7.6 (Flow Box) Let X be a C r vector field on M with r > 1. Then 
for every point Pq G M there exists a flow box for X atp^. If (U\, ai, F1 A ) and 
(t/ 2 ,a 2 ,Fl^) are two flow boxes for X at pofihen F1 A = F1 A on (— ai,ai) fl 
(—a 2 ) (* 2 ) x U\C\U 2 - 

Proof. First of all notice that the U in the triple (U, a, F1 A ) does not have 
to be contained in a chart or even homeomorphic to an open set in the model 
space. However, to prove that there are flow boxes at any point we can work in 
the domain of a chart U a , fi> a and so we might as well assume that the vector 
field is defined on an open set in the model space as in 7.5. Of course, we may 
have to choose a to be smaller so that the flow stays within the range of the 
chart map if a . Now a vector field in this setting can be taken to be a map 
U — * E so the theorem 7.5 provides us with the flow box data (V, a,<f>) where 
we have taken a > 0 small enough that V t = *!>(£, F) C U a for all t G (—a, a). 
Now the flow box is transferred back to the manifold via ifa 

U = if~\V) 

F1 A (t,p) = <S>(t,if a (p)). 

Now if we have two such flow boxes (17i,ai,Fl A ) and (172,a2,Fl A ) then by 
lemma 7.6 we have for any x G U\ fl U 2 we must have F1 A (f, x ) = Fl A '(t, x) for 
all t G (— ai, afi) fl (— 02 , 02 ). 

Finally, since both F1 A = Fl A (f, .) and Fl A t = F1 A (— t , .) are both smooth 
and inverse of each other we see that F1 A is a diffeomorphism onto its image 


Definition 7.12 Let X be a C r vector field on AI with r > 1. For any given 
p G AI let {T~ x ,Tp x ) CR be the largest interval such that there is an integral 
curve c : iT~ x ,T^ x ) ~ * M of X with c(0) = p. The maximal flow F1 A is 
defined on the open set (called the maximal flow domain) 


^V X = \J(T~ X ,T+ X )x{p}. 

Remark 7.2 Up until now we have used the notation F1 A ambiguously to refer 
to any (local or global) flow of X and now we have used the same notation for 
the unique maximal flow defined on XV X - We could have introduced notation 
such as Fl A ax but prefer not to clutter up the notation to that extent unless 
necessary. We hope that the reader will be able to tell from context what we are 
referring to when we write F1 A . 
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Exercise 7.2 Show that what we have proved so far implies that the maximal 
interval {T~ X ,T^ X ) exists for all p £ M and prove that TV x is an open subset 
of R x M. 

Definition 7.13 We say that X is a complete vector field iff TV x = R x M . 

Notice that we always have F1 A o F1 Y ' = Fl^ s = F1 Y " o Flf whenever s, t 
are such that all these maps are defined but if X is a complete vector field then 
this equation is true for all s,t £ R. 

Definition 7.14 The support of a vector field X is the closure of the set {p : 
X(p) ^ 0} and is denoted supp(X). 

Lemma 7.7 Every vector field that has compact support is a complete vector 
field. In particular if M is compact then every vector field is complete. 

Proof. Let c x be the maximal integral curve through p and (T~ x , T+ x ) 
its domain, ft is clear that for any t € (T^ X ,T+ X ) the image point c x (t) must 
always lie in the support of X. But we show that if T^ x < oo then given any 
compact set K C M, for example the support of X, there is an e > 0 such that 
for all t £ {Tp X ~ e >^p" y) the i ma g e c Y (f) is outside K. If not then we may 
take a sequence t* converging to T+ x such that c A (ij) £ K. But then going to 
a subsequence if necessary we have x* := c x {tf) — > x £ K. Now there must be 
a flow box (U, a, x ) so for large enough k, we have that tk is within a of Iff x 
and Xi = Cp(ti) is inside U. We then a guaranteed to have an integral curve 
c x .(t) of X that continues beyond T+ x and thus can be used to extend c x a 
contradiction of the maximality of Tp X . Hence we must have Tp X = oo. A 
similar argument give the result that Tff x = — oo. ■ 

Exercise 7.3 Let a > 0 be any positive real number. Show that if for a given 
vector field X the flow F1 A " is defined on (— a , a) x M then in fact the (maximal) 
flow is defined onl x M and so X is a complete vector field. 


7.8 Lie Derivative 


Let A be a vector field on M and let F1‘ Y (p, t ) = F1 Y (■ t ) = F1 A (p) be the flow 
so that 


d 

dt 


FI 


x, 


■ P (.t)=T 0 F1 Y - 


= X„ 


Recall our definition of the Lie derivative of a function (7.6). The following 
is an alternative definition. 


Definition 7.15 For a smooth function f : M — > R. and a smooth vector field 
X £ X(M) define the Lie derivative Cx of f with respect to X by 


Cx :/(p) 


d_ 

dt 


f oFl X (p,t) 


0 
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Exercise 7.4 Show that this definition is compatible with definition 7.6. 

Discussion: Notice that if X is a complete vector field then F1 Y is a dif- 
feomorphism M — » M and we may define (Fl Y '*Y)(p) = (T p Fl^') _1 Y(Flt(p)) 
or 

Flf*Y = (TFlf) -1 o YoFl t . (7.1) 

On the other hand, if X is not complete then we cannot say that Flj Y is a 
diffeomorphism of M since for any specific t there might be points for which 
Flj Y is not even defined! To see this one just needs to realize that the notation 
implies that we have a map t i— > F1 Y " £ Diff(M). But what is the domain? 
Suppose that e is in the domain. Then it follows that for all 0 < t < e the 
map Flj Y is defined on all of M and Fl^ (p) exists for 0 < t < e independent 
of p. But now a standard argument shows that t i— > F1 Y (p) is defined for all t 
which means that X is a complete vector held. If X is not complete we really 
have no business writing down the above equation without some qualification. 
Despite this it has become common to write this expression anyway especially 
when we are taking a derivative with respect to t. Whether or not this is just a 
mistake or liberal use of notation is not clear. Here is what we can say. Given 
any relatively compact open set U C M the map F1 Y will be defined at least for 
all t £ (—£,£■) for some £ depending only on X and the choice of U. Because of 
this, the expression Flj Y *Y = (T p Fl Y )~ o Y oFl* is a well defined map on U for 
all t £ (— £, e). Now if our manifold has a cover by relatively compact open sets 
M = (J Ui then we can make sense of F1^*Y on as large of a relatively compact 
set we like as long as t is small enough. Furthermore, if Flj Y *Y and F4 y *Y 

are both defined for the same t then they both restrict to F1?*Y . So 

UiDUj 

Fi^*y makes sense point by point for small enough t. It is just that “small 
enough” may never be uniformly small enough over M so literally speaking F1 j Y 
is just not a map on M since t has to be some number and for a vector field 
that is not complete, no t will be small enough (see the exercise 7.3 above). At 
any rate t i— > (Fl^*Y)(p) has a well defined germ at t = 0. With this in mind 
the following definition makes sense even for vector fields that are not complete 
as long as we take a loose interpretation of Fl f Y *Y . 

Definition 7.16 Let X and Y be smooth vector fields on M. Then the Lie 
derivative L X Y defined by L X Y:= | Q F1**Y. 

The following slightly different but equivalent definition is perfectly precise 
in any case. 

Definition 7.17 (version2 ) Let X and Y be smooth vector fields on M or 
some open subset of M. Then the Lie derivative L X Y defined by 

TF1* • Y f] x 0d) - Y 

L x Y(p) = Jim (P) (7.2) 


for any p in the domain of X. 
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Figure 7.1: Lie derivative defined by flow. 
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In the sequel we will sometimes need to keep in mind our comments above 
in order to make sense of certain expressions. 

If X, Y £ X(M) where M is modelled on a C°° Banach space then we have 
defined [X, Y] via the correspondence between vector fields and derivations on 
C°°(M) = $(M). Even in the general case [X, Y] makes sense as a derivation 
and if we already know that there is a vector held that acts as a derivation in 
the same way as [ X , Y ] then it is unique by 7.5. 

Theorem 7.7 If X,Y £ X(M) and f £ C°°(M) then (. L x Y)f = [X,Y]f. 
That is LxY and [X, Y] are equal as derivations. Thus LxY = [X, Y] as 
vector fields. In other words, LxY is the unique vector field such that 

( L X Y)(p ) = X p (Yf) - Y p (Xf) 

Proof. We shall show that both sides of the equation act in the same way as 
derivations. Consider the map a : I x I — > ffi. given by a(s, t) = Y (F1 A (p, s))(f o 
FI*). Notice that a(s, 0) = F(Fl A '(p, s))(f) and a(0,t) = Y(p)(f oFl A ) so that 
we have 

^ a(0 ’ 0 )= ^s\ o = ^s\ 0 YfO Fl * {P ' S) = XpYf 

and similarly 

^a(0,0 ) = Y p Xf. 

Subtracting we get [X, Y](p). On the other hand we also have that 

§i a{ °' 0) - 1 “ (0 ' 0) = i [ a{r ’ _r) = (LxY) - 

so we are done. To prove the last statement we just use lemma 7.5. ■ 


Theorem 7.8 Let X, Y £ X(M ) 

jL\f*Y = Flf*(L x yj 

Proof. 


d_ 

dt 



d_ 

ds 


Flf*(Flf*y) 

0 



(Fif*y) 

o 


= Flf ■ *L X Y 



7.8. LIE DERIVATIVE 


121 


Now we can see that the infinitesimal version of the action 


r (M,TM) x Diff(M) -► r(M, TM) 
(X, <F) i — ^ $*X 


is just the Lie derivative. As for the left action of Diff (M) on X(M) = T (M, TM) 
we have for X,Y £ X{M) 


d 

dt 


(FI lY){p) 
o 


d 

dt 


TFl t Y (F(Fl t - 1 (p))) 

o 


d_ 

dt 


(TFl5 t )- 1 y(Fl_ t (p)) 


-(L x Y) = -[X,Y\ 


It is easy to see that the Lie derivative is linear in both variables and over the 
reals. ■ 


Proposition 7.1 Let X £ X(M) and Y £ X(N) be (j)-related vector fields for 
a smooth map <j> : M — > N. Then 

4»oFlf = Flfo <j> 

whenever both sides are defined. Suppose that <f> : M — > M is a diffeomorphism 
and X £ X(M). Then the flow of <p*X = is <foF\^ o <p~^ and the flow 

of (jfX is cf)- 1 o Flj Y o <f>. 

Proof. Differentiating we have fjf((f> oFl Y ) = T(j> o j|Fl Y =TfioX oFlf" = 
Y o (f o F1 Y . But <f> o F1 Y (a;) = (f>(x) and so 1 1 — > <f> o F1 Y (a;) is an integral curve 
of Y starting at <j>(x). By uniqueness we have 0oFl Y (a:) = Fl^ (<j)(x)). ■ 

Lemma 7.8 Suppose that X £ X(M) and X £ X(N) and that <j> : M — > N is a 
smooth map. Then X X iff 

Xfo<j>= X(f o<j>) 

for all f £ C°°{U) and all open sets U C N. 

Proof. We have {X f o <j>)(p) = df^^X^fp)). Using the chain rule we have 

X{f°4>) ( P ) = d(f o <t>)(p)X p 
= df^,(p)T p (j) • Xp 

and so if X X then T(j> o X = X o and so we get Xf o <j> = X(f o fi). 
On the other hand, if Xf o (j) = X(f o fi) for all / £ C°°(U) and all open sets 
U C N then we can pick U to be a chart domain and / the pull back of a 
linear functional on the model space K™. So assuming that we are actually on 
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an open set U C R" and / = a is any functional from (R")* we would have 
Xa o (j>(p) = X(a o tf>) or da(X^, p ) = d(a o </>)X p or again 

do: (pp ( X j — doc^piTcj) * Xp ) 

<pp( X(pp ) = a,pp(T(f) • Xp ) 

so by the Hahn-Banach theorem X$ p = T(f> ■ X p . Thus since p was arbitrary 

■ 

Theorem 7.9 Let <j) : M —> N be a smooth map, 1,7 £ X(M), 1,7 £ X(N) 
and suppose that X X and Y Y . Then 

[X,Y] [X,Y] 

In particular, if (j> is a diffeomorphism then [</>*X, 4>*Y] = />*[X, Y]. 

Proof. By lemma 7.8 we just need to show that for any open set U C N 
and any / G C°°(U) we have ([X, Y]f) o cj> = [X, Y](f o <j>). We calculate using 
7.8: 


([X, Y}f) o = X[Y /) o 0 - Y(Xf) o </> 
= X{{Yf)o4>)-Y{{Xf)o4>) 
= X(Y(f O 4,)) - Y (X(/ o </)) = [X, X] o </. 


Theorem 7.10 For X,Y G X(M) eac/i of the following are equivalent: 

1. C X Y = 0 

,g. (Fif)*y = y 

5. T/ie /lows of X and Y commute: 

Flj v o Fig = Flf o Fl^ whenever defined. 

Proof. The equivalence of 1 and 2 is follows easily from the proceeding 
results and is left to the reader. The equivalence of 2 and 3 can be seen by 
noticing that Flj V o Fig = Fig o F1 j Y is true and defined exactly when Fig = 
Flf Y o Fig o Flj Y which happens exactly when 

Fir = Fli F1 ">* F 

and in turn exactly when Y = (Flj Y )*y. ■ 

Proposition 7.2 [fX,gY] = fg[X,Y] + f(Xg)Y - g(Y f)X . 
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This is a straightforward calculation. 

The Lie derivative and the Lie bracket are essentially the same object and 
are defined in for local sections X € Xm(U) as well as global sections. As is so 
often the case for operators in differential geometry, the Lie derivative is natural 
with respect to restriction so we have the commutative diagram 

X M (U) L ^ u X m (U) 

Ty I I r" 

X M (V) L ^ v X m (V) 

where Xu = X\ v denotes the restriction of X £ Xm to the open set U and ry 
is the map that restricts from U to V C U. 

7.9 Time Dependent Fields 

Consider a small charged particle pushed along by the field set up by a large sta- 
tionary charged body. The particle will follow integral curves of the field. What 
if while the particle is being pushed along, the larger charged body responsible 
for the field is put into motion? The particle must still have a definite trajectory 
but now the field it time dependent. To see what difference this makes, consider 
a time dependent vector field X(t,.) on a manifold M. For each x £ M let 
<j>t( x) be the point at which a particle that was at x at time 0,ends up after 
time t. Will it be true that <p s o (j>t{x) = <f s +t{x)I The answer is the in general 
this equality does not hold. The flow of a time dependent vector field is not a 
1-parameter group. On the other hand, if we define $ Sjt (a;) to be the location 
of the particle which was at x at time s at the later time t then we expect 

®s,r O $ r , t = $ S)t 

which is called the Chapman-Kolmogorov law. If in a special case ( h r i depends 
only on s — t then setting fit := <I>o.t we recover a flow corresponding to a 
time-independent vector held. The formal definitions are as follows: 

Definition 7.18 A C r time dependent vector field on M is a C r map X : 

(a, b) x M — * TM such that for each fixed f£(a,fi)ci the map X t : M — > TM 
given by X t (x) := X(t,x) is a C r vector field. 

Definition 7.19 Let X be a time dependent vector field. A curve c : (a, b) — > M 
is called an integral curve of X iff 

c(t) = X(t, c(t )) for all t £ (a, b). 

The evolution operator for X is defined by the requirement that 

and $* s (x) = x. 

In other words, t i— > ^f s (x) is the integral curve which goes through x at time 
s. 
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We have chosen to use the term “evolution operator” as opposed to “flow” 
in order to emphasize that the local group property does not hold in general. 
Instead we have the following 

Theorem 7.11 Let X be a time dependent vector field. Suppose that X t £ 
X r (M) for each t and that X : ( a,b ) x M — > TM is continuous. Then is 
C r and we have 4> s>0 o 4> a t = 4> s t whenever defined. 

Theorem 7.12 Let X and Y be smooth time dependent vector fields and let 
f : KxM — > K. be smooth. We have the following formulas: 

h = {*&*{% + x t f t ) 

where ft() := /(£, .), and 

d r)Y 



Chapter 8 

Lie Groups II 


Definition 8.1 For a Lie group G and a fixed g £ G, the maps L g :G — > G 
and R g : G — > G are defined by 

L g x = gx for x £ G 
R g x = xg for x £ G 

and are called left translation and right translation respectively. 

Definition 8.2 A vector field X £ X(G ) is called left invariant iff (L g ) t X = 
X for all g £ G. A vector field X £ X(G) is called right invariant iff 
(R g )*X = X for all g £ G. The set of left invariant (resp. right invariant) 
vectors Fields is denoted £(G) orX L {G) (resp. 91(G) or X R (G)). 

Note that X £ X(G) is left invariant iff the following diagram commutes 

TG T -^ 9 TG 

I I 

G L -A G 

for every g = g £ G. There is a similar diagram for right invariance. 

Lemma 8.1 X L (G) is closed under the Lie Bracket operation. 

Proof. Suppose that X, Y £ X L {G). Then by 7.9 we have 

(L g )4X,Y\ = C X Y = C Lg , x L g *Y 
= [L g »X, L g *Y] = [X,Y] 


Corollary 8.1 X L (G) is an n- dimensional Lie algebra under the Bracket of 
vector fields (see definition ?? ). 
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Lemma 8.2 Given a vector v € T e G we can define a smooth left- (resp. right) 
invariant vector field L v (resp. R v ) such that L v (e ) = v (resp. R v (e ) = v) by 
the simple prescription 


L v (g ) = TL g • v 
R v {g) = ( resp . TR g -v) 

and furthermore the map v L v is a linear isomorphism from T e G onto X L (G). 

Proof. The proof that this gives the invariant vector fields as prescribed is 
easy and left as an exercise. ■ 

Definition 8.3 To every v £ T e G we associate a left ( resp. right ) invariant 
vector field via the map 


L : v i— > L v (resp. R : v i— > R v ). 

Now we can transfer the Lie algebra structure to T e G by defining a bracket 
operation on T e G by using the bracket of the corresponding left invariant vector 
fields. 

Definition 8.4 For a Lie group G , define the bracket of any two elements v,w £ 
T e G by 

M := [L v ,L w ](e). 

The Lie algebras thus defined are isomorphic by construction both of them 
are often referred to as the Lie algebra of the Lie group G and denoted by £ie(G) 
or g. Of course we are implying that £i e(H) is denoted f) and £i e(K) by 6 etc. 
Our specific convention will be that g = £ie(G) := T e G with the bracket defined 
above and then occasionally identify this with the left invariant fields X L (G) 
with the vector field Lie bracket defined in definition ??. 

Let us compute the form of the Lie bracket for the Lie algebra of the matrix 
general linear group. First of all this Lie algebra is T/GL(n) which is canonically 
isomorphic to the vector space of all matrices M„ x „ so we set 

0[(?r) = M„ xn . 

Now a global coordinate system for GL(n) is given by the maps 

x k i : (a)) i ^ af. 

Thus any vector fields X, Y £ X(GL(n)) can be written 
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and hence for some functions f‘ and g). Under the canonical isomorphism we 
have two matrix valued functions F = (f)) and G = ( g ) ) . The bracket is then 


d 


\X Y] = p d9 U “ 

’ 3 dx) 3 dx) ' dx * 


Now the left invariant vector fields are given by g i— > gA for constant matrix A. 
In coordinates, left invariant vector fields corresponding to A, B £ g[(?i) would 
be 

X = xi a) 7 ^ and 
k 3 dx) 

Y = x k^Sj 
k 3 dx) 


and then the bracket 

[X,Y] = 


.5(4 b 1 )) _ br d(x i k a))' 


dxl 


dxl 


dx) 


= ( 44 - 44)^7 


which corresponds to the matrix AB — BA. So we have 


Proposition 8.1 Under the identification of XjGL(n) with M nxn the bracket 
is the commutator bracket 


[A, B}= AB- BA 

Similarly, under the identification of T^GlfV) with End(V) the bracket is 

[A,B}= AoB-BoA. 

Now we can use the maps TL g -i and TR g -i to identify vectors in T g G with 
unique vectors in T e G = g: Define the maps wq : TG — » g (resp. u)Q 9ht : 
TG — ► g) by 


w G (X g ) = TL g -i ■ X g 
(resp. w r J 9ht (X g ) = TR g -i ■ X g ) 

wq is a g valued 1-form called the (left-) Maurer Cartan form. We will call 
u>Q 9ht the right-Maurer Cartan form but we will not be using it to the extent of 
lug- Now we can define maps triv^ : TG — > G x g and triv^ : TG — > G x g by 


triv L (u s ) = (g,u G {v g )) 
triv H (u g ) = (g,cue 9ht (v g )) 

for v g £ T g G. These maps are both vector bundle isomorphisms. Thus we have 
the following: 
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Proposition 8.2 The tangent bundle of a Lie group is trivial: TG = Gxg. 

Proof. Follows from the above discussion. ■ 

Lemma 8.3 (Left-right lemma) For any v £ 0 the map g i— > triv£ 1 (g,u) is 
a left invariant vector field on G while g i— > triv) ? 1 (g, v) is right invariant. Also, 
triv r o triv £ 1 (g, u) = (g, Ad g (v)). 

Proof. The invariance is easy to check and is left as an exercise. Now the 
second statement is also easy: 

trivfl o triv£ 1 (n) 

= (. g,TR g -iTL g v ) = (g,T(R g -iL g ) ■ v) 

= (g,Ad g (v)). 


Example 8.1 Consider the group SU(2) defined above. Suppose that g(t) is a 
curve in SU( 2) with g( 0) = e = I. Then g(t)g(t) = I and so 


o=| 

dt 




= g'(0) +g'(0) 


so g'{ 0) is skew-Hermitian: g'(0) = — </(0). Thus every element of T e SU (2) = 
su(2) is a skew-Hermitian matrix. But we also have the restriction that det(g(t)) = 
1 and this means that g'{ 0) also has to have trace zero. In fact, if A is any skew- 
Hermitian matrix with trace zero then g(t) = e tA is a curve with g'( 0) = A so 
the Lie algebra su(2) is the set vector space of skew-Hermitian matrices with 
trace zero. We can also think of the Lie algebra of SU (2) as Hermitian matri- 
ces of trace zero since A < — * i.A is an isomorphism between the skew-Hermitian 
matrices and the Hermitian matrices. 


Proposition 8.3 Given a Lie group homomorphism h : G\ — » G 2 we have 
that the map T e h : 0 i — > 02 is a Lie algebra homomorphism called the Lie 
differential which is often denoted in this context by dh : g 1 — > g 2 or by 
Lh : 0 i — > 0 2 . 

Proof. For v £ 0i and x £ G we have 

T x h ■ L v (x) = T x h ■ ( T e L x ■ v) 

= T e (h o L x ) ■ v 

= T e{L h{x) o h) ■ v 

= T e (L h ( x )) oT e h ■ v 
= L dh ^ v \h{ x)) 
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so L v L dh ( v \ Thus by 7.9 we have for any v,w £ 0i that L^ v,w ^ ^ h 
[ L dh(v) L dh{w)\ or in other worc j S) [L dh (v) ; L d Kw)] oh = Tho L^ which at e 


gives 


[dh(v), dh(w)) = [«,«;]. 


Theorem 8.1 Let invariant vector fields are complete. The integral curves 
through the identify element are the one-parameter subgroups. 

Proof. Let X be a left invariant vector field and c : (a, b) — > G be the integral 
curve of X with c(0) = X(p). Let a < t\ < t 2 < b and choose an element g £ G 
such that gc(ti) = c(f 2 ). Let At = f 2 — t\ and define c : (a + At, b + At) — * G 
by c(t) = gc(t — At). Then we have 

c(t) = TL g • c {t — At) = TL g ■ X(c(t - At)) 

= X(gc(t - At)) = X(c(t )) 

and so c is also an integral curve of X. Now on the intersection (a + At, b) of 
the their domains, c and c are equal since they are both integral curve of the 
same held and since c(f 2 ) = c/c(ti) = c(t 2 ). Thus we can concatenate the curves 
to get a new integral curve defined on the larger domain (a, b + At). Since this 
extension can be done again for a fixed At we see that c can be extended to 
(a, 00 ). A similar argument gives that we can extend in the negative direction 
to get the needed extension of c to (— 00 , 00 ). 

Next assume that c is the integral curve with c(0) = e. The proof that 
c(s + t) = c(s)c(t) proceeds by considering 7(f) = c(s) _1 c(s + 1). Then 7(0) = e 
and also 

7 '(t) = TL c ( s) -i ■ c'(s + t) = TL c{s) - 1 • X(c(s + t)) 

= X(c(s)~ x c(s + t)) = X(j(t)). 

By the uniqueness of integral curves we must have c(s)^ 1 c(s + 1) = c(t) which 
implies the result. Conversely, if c : R — > G is a one parameter subgroup the 
let X e = c(0) then there is a left invariant vector held X such that X(e) = X e . 
We must show that the integral curve through e of the held X is exactly c. 
But for this we only need that c(t) = X(c(t)) for all t. Now c(t + s) = c(s) or 
c(t + s) = L c (t)c(s). Thus 

C(t+s) = (T c(t) L).c(0) = X(c(t)) 

0 

and we are done. ■ 

Lemma 8.4 Let v £ g = T e G and the corresponding left invariant field L v . 
Then with Fl v := Fl L we have that 



Fl v (st) = Fl sv (t) 
A similar statement holds with R v replacing L v . 


(8.1) 
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Proof. Let u = st. We have that £\ t=0 FF(st) = % £| t=0 ^»f = su 
and so by uniqueness Fl v (st,) = Fl sv (t). m 

Definition 8.5 For any = T e G we have the corresponding left invariant 
field L v which has an integral curve through e which we denote by exp(tu). Thus 
the map t — » exp(fu) is a Lie group homomorphism from R. into G which is a 
one-parameter subgroup. The map v i— > exp(lu) = exp G (u) is referred to as the 

exponential map exp G : g — > G. 

Lemma 8.5 exp G : g — > G is smooth. 

Proof. Consider the map IxGxg-+Gxg given by (t, g, v) i— > (g ■ 
exp G (tv),v). This map is easily seen to be the flow on G x g of the vector field 
X : (g,v) i— > (L v (g), 0) and so is smooth. Now the restriction of this smooth 
flow to the submanifold { 1 } x {e} x 0 is (l,e, v) e- > (exp G (u),u) is also smooth 
which clearly implies that exp G (u) is smooth also. ■ 


Theorem 8.2 The tangent map of the exponential map exp G : g G is the 
identity at the origin 0 € T e G = g and exp is a diffeomorphism of some neigh- 
borhood of the origin onto its image in G. 

Proof. By lemma 8.5 we know that exp G : g — » G is a smooth map. Also, 
| 0 exp G (fu) = v which means the tangent map is v h- * v. If the reader thinks 
through the definitions carefully, he or she will discover that we have here used 
the natural identification of g with T 0 g. ■ 

Remark 8.1 The “one-parameter subgroup” exp G (tv) corresponding to a vec- 
tor v £ g is actually a homomorphism rather than a subgroup but the terminology 
is conventional. 


Proposition 8.4 For a 

diagram commutes: 


(Lie group) homomorphism h : Gi — * G^ the following 
dh 

01 — > 02 

exp Gl | exp Ga i 

Gi A G 2 


Proof. The curve t, i— > /i(exp Gl (tv)) is clearly a one parameter subgroup. 
Also, 


d_ 

dt. 


h(exp Gl (tv)) = dh(v) 


so by uniqueness of integral curves h(exp Gl (tv)) = exp Gs (tdh(v)). 


Remark 8.2 We will sometimes not index the maps and shall just write exp 
for any Lie group. 
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The reader may wonder what happened to right invariant vector fields and 
how do they relate to one parameter subgroups. The following theorem give 
various relationships. 


Theorem 8.3 For a smooth curve c : K. — » G with c(0) = e and c(0) = v, the 
following are all equivalent: 

1. c{t) = Flf v (e) 

2. c{t) = Flf (e) 

3. c is a one parameter subgroup. 

4- Fit = Rc( t ) 

5. Flf = L c{t) 


Proof. By definition Flf (e) = exp (to). We have already shown that 1 
implies 3. The proof of 2 implies 3 would be analogous. We have also already 
shown that 3 implies 1. 

Also, 4 implies 1 since then Flf z (e) = R c ^(e) = c(t). Now assuming 1 we 
have 


c(t)=Flf(e ) 


d_ 

dt 


c(t ) = L v (e) 


d_ 

dt 


sc{t) =dt 


Lg( C {t)) 


= TLgV = L v (g) for any g 


d_ 

dt 


o 




L v (g) for any g 
R c {t) = Fit 


The rest is left to the reader. ■ 

The tangent space at the identity of a Lie group G is a vector space and is 
hence a manifold. Thus exp is a smooth map between manifolds. As is usual 
we identify the tangent space T v (T e G) at some v £ T e G with T e G itself. The 
we have the following 

Lemma 8.6 T e exp = id : T e G —> T e G 

Proof. T e exp-u= | Q exp(to) = v. m 

The Lie algebra of a Lie group and the group itself are closely related in 
many ways. One observation is the following: 

Proposition 8.5 If G is a connected Lie group then for any open neighborhood 
V C g of 0 the group generated by exp(F) is all of G. 
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sketch of proof. Since T e exp = id we have that exp is an open map near 0. 
The group generated by exp(V r ) is a subgroup containing an open neighborhood 
of e. The complement is also open. ■ 

Now we prove a remarkable theorem which shows how an algebraic assump- 
tion can have implications in the differentiable category. First we need some 
notation. 

Notation 8.1 If S is any subset of a Lie group G then we define 

S - 1 = {s" 1 : s € S} 


and for any x £ G we define 


xS = {xs : s £ 5}. 

Theorem 8.4 An abstract subgroup H of a Lie group G is a (regidar) subman- 
ifold iff H is a closed set in G. If follows that H is a (regular) Lie subgroup of 

G. 


Proof. First suppose that H is a (regular) submanifold. Then H is locally 
closed. That is, every point x £ H has an open neighborhood U such that 
U fl H is a relatively closed set in H. Let U be such a neighborhood of the 
identity element e. We seek to show that H is closed in G. Let y £ H and 
x £ yU -1 fl H. Thus x £ H and y £ xU. Now this means that y £ H fl xU, and 
thence x~ x y £ H fit/ = H CiU. So y £ H and we have shown that H is closed. 

Now conversely, let us suppose that H is a closed abstract subgroup of G. 
Since we can always use the diffeomorphism to translate any point to the identity 
it suffices to find a neighborhood U of e such that U fl H is a submanifold. The 
strategy is to find out what Lie(H) = f) is likely to be and then exponentiate a 
neighborhood of e £ f). 

First we will need to have an inner product on T e G so choose any such. 
Then norms of vectors in T e G makes sense. Choose a smalHreighborhood U of 
0 £ T e G = g on which exp is a diffeomorphism say exp :[/—>[/ with inverse 
denoted by logy. Define the set H in U by H = log V (H fl U). 

Claim 8.1 If h n is a sequence in H converging to zero and such that u n = 
hn/ \h n \ converges to v £ g then exp (tv) € H for all i£R. 

Proof of claim: Note that th n /\h n \ — > tv while \h n \ converges to zero. But 
since \h n \ — > 0 we must be able to find a sequence k(n) £ Z such that k(n) \h n \ — > 
t. From this we have exp (k(n)h n ) = exp (k(n) \h n \ jTpj) — > exp(fi>). But by the 
properties of exp proved previously we have exp (k(n)h n ) = (exp (h n )) k ( n \ But 
exp (h n ) £ HflU C H and so (exp (h n )) k ^ £ H . But since H is closed we have 
exp(fu) = lim ?woo (exp(/i„)) fe(n) € H. 

Claim 8.2 The set W of all tv where v can be obtained as a limit h n / \h n \ — > v 
with h n £ H is a vector space. 
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Proof of claim: It is enough to show that if h n / \h n \ — > v and h' n / \h' n \ — > w 
with h' n , h n £ H then there is a sequence of elements ft" from H with 


K/K 


V + w 
|u + w\ 


This will follow from the observation that 


h(t ) = log c/ (exp(to) exp(tio)) 
is in H and by exercise 5.5 we have that 


lim h(t)/t = v + w 

tj.0 


and so 


h(t)/t v + w 

\h(t)/t\ |i> + w| 


The proof of the next claim will finish the proof of the theorem. 


Claim: Let W be the set from the last claim. Then exp(VF) contains an open 
neighborhood of e in H. Let W 1 - be the orthogonal compliment ofW with respect 
to the inner product chosen above. Then we have T e G = W 1 - © W. It is not 
difficult to show that the map S : W © W 1 - — » G defined by 

v + w i— > exp(i>) exp (to) 

is a diffeomorphism in a neighborhood of the origin in T e G. Now suppose that 
exp(W) does not contain an open neighborhood of e in H. Then we can choose 
a sequence (v n ,w n ) £ W © W - 1 with (v n ,w n ) — > 0 and exp(w n ) exp(w n ) £ H 
and yet w n 0. The space W 1 - and the unit sphere in W 1 - is compact so 
after passing to a subsequence we may assume that w n /\w n \ — > w £ W ± and 
of course |u>| = 1. Since exp(v n ) £ H and H is at least an algebraic subgroup, 
exp(iVi.) exp(w„) £ H , it must be that exp (w n ) £ H also. But then by the 
definition ofW we have that w £ W which contradicts the fact that |w| = 1 and 
w £ W- 1 . 


8.1 Spinors and rotation 

The matrix Lie group SO( 3) is the group of orientation preserving rotations of 
JR 3 acting by matrix multiplication on column vectors. The group SU(2) is the 
group of complex 2x2 unitary matrices of determinant 1. We shall now expose 
an interesting relation between these groups. First recall the Pauli matrices: 


1 

0 



o 1 = 


0 

1 



O 2 = 


0 

—i 



, ©3 = 


<Jo = 


1 0 
0 -1 
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The real vector space spanned by a -\ , a-i , <73 is isomorphic to R 3 and is the 
space of traceless Hermitian matrices. Let us temporarily denote the latter 
by R 3 . Thus we have a linear isomorphism R 3 — > R 3 given by (a; 1 , x 2 , x 3 ) 1 — > 
adoy + x 2 (J 2 + x 3 (J 3 which abbreviate to x 1 — > x. Now it is easy to check that 
det(x) is just — |a?| 2 . In fact, we may introduce an inner product on R 3 by 
the formula (x,y) := — | tr(xy ) and then we have that x 1 — > x is an isometry. 

Next we notice that SU( 2) acts on R 3 by (g,x) 1 — > gxg- 1 = gxg* thus giving 
a representation p of SU(2) in R 3 . It is easy to see that (p(g)x, p{g)y) = (x,y) 
and so under the identification R 3 <-> R 3 we see that SU( 2) act on R 3 as an 
element of 0(3). 


Exercise 8.1 Show that in fact, the map 517(2) — > 0(3) is actually a group 
map onto SO( 3) with kernel {±7} = Z 2 . 


Exercise 8.2 Show that the algebra generated by the matrices <to, *<Ji, *02, *03 
is isomorphic to the quaternion algebra and that the set of matrices io\, iu-i, *03 
span a real vector space which is equal as a set to the traceless skew Hermitian 
matrices su( 2 ) . 


Let I = iu\ , J — io 2 and ia 3 = K. One can redo the above analysis using 
the isometry R 3 — > su(2) given by 

(a: 1 , a; 2 , x 3 ) ^ x 3 I + x 2 J + x 3 K 
x 1 — ► x 

where this time (x,y) := \tr(xy*) = — \tr(xy ). Notice that su(2) = span{I, J, K} 
is the Lie algebra of SU{ 2) the action (g,x) 1 — > gxg- 1 = gxg* is just the ad- 
joint action to be defined in a more general setting below. Anticipating this, 
let us write Ad(g) : x 1 — > gxg*. This gives the map g 1 — > Ad(g)\ a Lie group 
homomorphism SU(2) —> SO( su( 2 ),(,)). Once again we get the same map 
SU( 2) — > 0(3) which is a Lie group homomorphism and has kernel {±7} = Z 2 . 
In fact, we have the following commutative diagram: 

SU( 2) = SU{ 2) 

Pi Adi 

50(3) - 50(su(2), (, )) 

Exercise 8.3 Check the details here! 

What is the differential of the map p : SU(2) — » 0(3) at the identity? 
Let g(t) be a curve in 517 ( 2) with fjf\ t _ 0 g = g' ■ We have ^(g(t)Ag*(t)) = 
(i L t 9 (t))Ag*(t ) + g(t)A(Ag(t))* and so the map ad : g' ^ g'A + Ag'* = [g ' , A] 
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= J^2 tr{9x{9W) 
l -tr([g',^(y)*) + ltr(x,([g',W) 

== \tr([g',x\,(y)*) - ^tr(x, [ g' ,y\ ) 

= ([g',x\,y) - (x, [ g',y] ) 

= .(ad(g')x, y) - ( x,ad(g')y) 

From this is follows that the differential of the map 517(2) — > 0(3) takes su(2) 
isomorphically onto the space so (3). We have 

su(2) = su(2) 

dp l ad | 

50(3) S so(su(2), (, )) 

where so(su(2), (, )) denotes the linear maps su(2) — > su(2) skew-symmetric with 
respect to the inner product (x, y) := ^ tr(xy *). 



CHAPTER 8. LIE GROUPS II 



Chapter 9 

Multilinear Bundles and 
Tensors Fields 


A great many people think they are thinking when they are merely 
rearranging their prejudices. 

-William James (1842-1910) 

Synopsis: Multilinear maps, tensors, tensor fields. 


9.1 Multilinear Algebra 

There a just a few things from multilinear algebra that are most important 
for differential geometry. Multilinear spaces and operations may be defined 
starting with in the category vector spaces and linear maps but we are also 
interested in vector bundles and their sections. At this point we may consider 
the algebraic structure possessed by the sections of the bundle. What we have 
then is not (only) a vector space but (also) a module over the ring of smooth 
functions. The algebraic operation we perform on this level are very similar to 
vector space calculations but instead of the scalars being the real (or complex) 
numbers the scalars are functions. So even though we start by defining things 
for vector spaces it is more efficient to consider modules over a commutative 
ring R since vector spaces are also modules whose scalar ring just happens to 
be a field. We do not want spend too much time with the algebra so that we 
may return to geometry. However, an appendix has been included that covers 
the material in a more general and systematic way that we do in this section. 
The reader may consult the appendix when needed. 

The tensor space of type (r, s) is the vector space T r | s (V) of multilinear 
maps of the form 


Y:V*xV*---xV*xVxVx---xV— >W. 

' V ' ' v ' 

r — times s — times 
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There is a natural bilinear operation called the (consolidated 1 ) tensor product 
T r ' | 51 (V) x T% a (V) - T ri+r2 | Sl+S2 (V) defined for T, G T' | Si (V) and 
T 2 G T r2 \ S2 (V) by 

(Ti ® T 2 )(ai, ...,a r ,vi , ..., v s ) 

:= T 1 (a 1 , a r )Y 2 (vi , ..., v s ) 

We may put the various types together into direct sum which is a sort of master 
tensor space. 

r,s 

Tq (V) is an algebra over R. with the tensor product. In practice, if one is 
interested in an algebra that is useful for constructing other algebras then the 
smaller space T$(V) = | will usually do the job. It is this later space that 

is usually refereed to as the tensor algebra books and simply denoted T(V). 

Tensors and the tensor algebra is often defined in another way that is equiv- 
alent for finite dimensional vector spaces. The alternate approach is based on 
the idea of the tensor product of two vector spaces. The tensor product V <g> W 
of vector spaces V and W is often defined in an abstract manner as the universal 
object in the category whose objects are bilinear maps VxW-tE and where 
a morphism between and object V x W — > Ei and an object V x W — > E 2 is a 
map Ei — > E 2 which make the following diagram commute: 

V x W -► Ei 

\ I 

E 2 

On the other hand it is quite easy to say what V ® W is in practical terms: 
It is just the set of all symbols v ® w where v G V and w G W and subject to 
the relations v ® w 

Our working definition comes out of the following theorem on the existence 
of the universal object: 

Theorem 9.1 Given finite dimensional vector spaces V and W there is a vector 
space denoted V ® W together with a bilinear map u : V x W — > V ® W has the 
following property: 

Definition 9.1 (Universal property) Given any bilinear map : V x W — > E, 
where is another finite dimensional vector space, there is a unique linear map 
b : V ® W — > E such that the following diagram commutes: 

V x W 4 E 

A 

V® w 

1 The word “consolidated” refers to the fact that we let the contravariant slots and the 
covariant slots slide past each other so that a consolidated tensor product would never give a 
multilinear function like ^ X V* X V -> M. Some authors feel the need to do it otherwise but 
we shall see how to avoid involving shuffled spaces like T r s k kl (V) for example. 
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There is an obvious generalization of this theorem which gives the existence 
of a universal object for k — multilinear maps ViX---xVfc— >E. Note that the 
spaces Vi, Vfc are fixed but the E may vary. The universal space U (together 
with the associated map Vi x x Vt ^ 17 is called the tensor product of 
Vi, Vfc and is denoted V 1 0 • • • 0 Vfc. 

Now that we have the existence of the tensor product we make the following 
practical observations: 

1. The image of the map it:V xW-*V®Wisa spanning set for V 0 W 
so if we denote the image of a pair (v, w) by v®w then every element 
of V 0 W is a linear combination of elements of that form. These special 
elements are called simple or indecomposable. 

2. Suppose we want to make a linear map from V 0 W to some vector space 
E. Suppose also that we know what we would like the linear map to do to 
the simple elements. If the rule we come up with is bilinear in the factors 
of the simple elements then this rule extends uniquely as a linear map. 
For example, consider V 0 Hom(V, E) and the rule v§A-» Av. This rule 
is enough to determine a unique linear map V 0 Hom(V, E) — » E which 
agrees with the prescription on simple elements (the set of simple elements 
which is probably not even a vector space) . We will call this the extension 
principal. 

3. If {vi, ..., v m } is a basis of V and {wi, ..., w„} a basis of W, then a basis of 
V® W is the doubly indexed set {vj0Wj : 1 < i < n, 1 < j < m}. Thus a 
typical element Y G V 0 W has the expansion T = ^a* J Vj0Wj for some 
real numbers a l L Similarly, for T € V 0 V 0 V* we have 

T = ^« ,j fc Vj0Vj0td 

where {u 1 ,...,?;"} is the basis for V* which is dual 2 to {v 1; ...,v m }. In 
general, a basis for V 0 • • • V 0 V* 

Exercise 9.1 Show that the definition of tensor product implies that the simple 
elements span all of V 0 W. The uniqueness part of the definition is key here. 

Exercise 9.2 Show that the extension principle follows directly from the defini- 
tion of the tensor product and is essentially just a reformulation of the defining 
properties. 

So now we need to think about a few things. For instance, what is the 
relation between say T 1 \ 1 (V) = L(V*,V) and V* 0 V or V0 V*. What about 
V* 0 V and V 0 V*. Are they equal? Isomorphic? There is also the often 
employed relationship between W 0 V* and L(V, W) to wonder about. Let us 
take a quick tour of some basis relations. 

2 So we have v l (vj) = <5*. 
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This one is a natural inclusion given by v i— > (v : V* — > R) where v : 
ai— >a(v). For finite dimensional vector spaces this is an isomorphism. 
This is also true by definition for reflexive Banach spaces and only in this 
case will we be able to safely say that T 1 | 0 (V) = V unless we simply 
modify definitions (some authors do just that). 


2 . 


L(V, W; R) = L(V ®W,K) = (b« W)* 


This one is true by the definition of tensor product V ® W given above. 
It says that the bilinear maps VxW-+K are in 1-1 correspondence with 
linear maps on a special space called the tensor product of V and W. The 
obvious generalization of this one is T(Vi, V r ; E) = L(Vi® ■ • - ®V r ,E). 
Also, a special case worth keeping in mind is L(V*, V; R) = (V* ® V)*. 


3. 


V® V* ^ L(V*,V;R) 


This one is curious. It is another natural isomorphism that does not 
generalize well to infinite dimensions. The isomorphism goes like this: 
Let v ® a be the image of (v,a) £ V x V* under the universal maps 
as described above. Elements of this form generate all of V ® V* and 
specifying where these elements should go under a linear map actually 
determines the linear map as we have seen. In the current case we use 
v ® a to define an element i(v ® a) € L(V*,V;R): 


i(v 0 a)(f 3, w) := /3(v)a( w). 


This isomorphism is a favorite of differential geometers but if one plans 
to do differential geometry on general Banach spaces the isomorphism is 
not available. Sometimes one can recover something like this isomorphism 
but it takes some work and by then it is realized how small the role of this 
actually was after all. One can often do quite well without it. 

There is something crucial to observe here. Namely, under the identifi- 
cation of V**with V, the isomorphism above just sends v 0 a £ V 0 V* 
to the tensor product of the two maps v : V*— > R and a : V— > R which is 
also written v0a the result being in T 1 1 (V) = L(V*, V; R). The point is 
we already had a meaning for the symbol 0 in the context of multiplying 
multilinear maps. Here we have just seen that v 0 a € V 0 V* interpreted 
properly just is that product. Thus we will actually identify v ® a with 
c(v0a) and drop the t completely. Conclusion: v®a is a multilinear map 
in either case. 


4 . 


W® V* =* L(V,W) 
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This last natural isomorphism is defined on simple elements using the 
formula 

t(v ® a)(/3) • w := a(w)v 

which then extends to all elements of V ® W* by the extension principal 
described above. For reasons similar to before, the map r(v ® a) will be 
identified with v®a and now v®a has yet another interpretation. Tensors 
generally have several interpretations as multilinear maps. For example, if 
T : V x W x W* -+ I is a multilinear map into R. then T :vmT(v, ., .) 
defined a multilinear map V — + L( W, W*). 

Exercise 9.3 Let V and W be finite dimensional. Prove that (V ®> W)* is 
naturally isomorphic to V* ® W*. Note that what makes this work is the fact 
that V** may be identified with V. 

Exercise 9.4 Prove that for finite dimensional vector spaces V®V* L(V* , V;R) 
described above is an isomorphism. 

9.1.1 Contraction of tensors 

Consider a tensor of the form vi ® V 2 <S> ry 1 <E> rj 2 £ T|(V) we can define the 1, 1 
contraction of Vi ® V 2 0 rfi ® rj 2 as the tensor obtained as 

C\(v i <8) V 2 ® jy 1 ® rj 2 ) = rfijv i)v 2 ® rj 2 . 

Similarly we can define 

c \ (vi ® v 2 ® 7? 1 ® ry 2 ) = ry 2 (v!)v 2 ® ry 1 . 

In general, we could define Cj on “monomials” vi ® v 2 . . . ® v r ® ry 1 ® ?y 2 ® . . . ® ry s 
and then extend linearly to all of XJ (V). This works fine for V finite dimensional 
and turns out to give a notion of contraction which is the same a described in 
the next definition. 

Definition 9.2 Let {ei,...,e„} C N be a basis for V and {c 1 . ..., c ." } C V* the 
dual basis. If r G TJ(V) we define C't € Tfzj(V) 

C} T (0 1 ,...,0 r ~ 1 , wi,..,w s _i) 

n 

= e k ., ...,0 r-1 ,wi,..., e fc ,...,w s _i). 

, ; j~th 

fc= i j 

It is easily checked that this definition is independent of the basis chosen. In 
the infinite dimensional case the sum contraction cannot be defined in general 
to apply to all tensors. However, we can still define contractions on linear 
combinations of tensors of the form vi ® v 2 ... ® v r ® rj 1 ® ?y 2 ® ... ® rj s as we did 
above. Returning to the finite dimensional case, suppose that 

t = T h ® ••• ® e *r- ® e jl ® ... ® e is . 
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Then it is easy to check that if we define 

where the upper repeated index k is in the i - th position and the lower occurrence 
of k is in the j - th position then 

CjT = ^2 ® "• ® e v- i e31 <S> ••• ® e 03 - 1 . 

Even in the infinite dimensional case the following definition makes sense. The 
contraction process can be repeated until we arrive at a function. 

Definition 9.3 Given r € TJ and a — wj g ... g wj g r/ 1 ^rj 2 g ... g r] m £ T l m 
a simple tensor with l < r and m < s, we define the contraction against a by 

u jt(qu , . .. , ot r —i , v^ , . .. , \ r —i ) 

:= C(t (g) (wi <g> ... g wi g r] 1 (g rj 2 <S» ••• g »? m )) 

:= r(?7 1 , ...,77 m ,ai, a r _j, wi, ...,w;,V!, ...,v r _j). 

For a given simple tensor a we thus have a linear map er_i : TJ — > Tfz2- F° r 
finite dimensional V this can be extended to a bilinear pairing between T ,l m and 

r 

ri(v) x t;(v) ^ Tjri(v). 

Exercise 9.5 Show that all the different interpretations of a tensor may be re- 
alized by tensoring with a some number of variable tensors and then contracting. 
Hint: What would the following signify? 

T : ( vi,a l ) i— > m T ife | a* (sum over i and l) 

Extended Matrix Notation. Notice that if one takes the convention that 
objects with indices up are “column vectors” and indices down “row vectors” 
then to get the order of the matrices correctly the repeated indices should read 
down then up going from left to right. So Ajeie J should be changed to ejAje J 
before it can be interpreted as a matrix multiplication. 

Remark 9.1 We can also write Af AjA J L , = A(f, where Ag, = e R e' s . 

9.1.2 Alternating Multilinear Algebra 

In this section we make the simplifying assumption that all of the rings we use 
will have the following property: The sum of the unity element with itself; 1 + 1 
is invertible. Thus if we use 2 to denote the element 1 + 1 then we assume the 
existence of a unique element “1/2” such that 2 • 1/2 = 1. Thus, in the case 
of fields, the assumption is that the field is not of characteristic 2. The reader 
need only worry about two cases: 


n 



k= 1 
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1. The unity “1” is just the number 1 in some subring of C (e.g. R or Z) or 

2. the unity “1” refers to some sort of function or section with values in a 
ring like C, R or Z which takes on the constant value 1. For example, in 
the ring C°°(M), the unity is just the constant function 1. 

Definition 9.4 A Z -graded algebra is called skew- commutative (or graded com- 
mutative ) if for ai £ Ai and aj € Aj we have 

at ■ aj = (— 1) uj • at 

Definition 9.5 A morphism of degree n from a graded algebra A = © igZ Ai 
to a graded algebra B = © igZ Bi is a algebra homomorphism h : A — » B such 
that h(Af) C Bi +n . 

Definition 9.6 A super algebra is a Z 2 - graded algebra A = Aq © A\ such that 
Ai ■ Aj C A i+ j mo d 2 and such that a.i ■ aj = (— 1 ) kl aj ■ ai for i,j £ Z 2 . 


Alternating tensor maps 

Definition 9.7 A k-multilinear map a : W x • • • x W — >F is called alter- 
nating if a( wi,..,,Wfc) = 0 whenever w* = w j for some i ^ j. The space of 
all alternating k-midtilinear maps into F will be denoted by L k lt (W',F) or by 
Laiti W) if the ring is either R or C and there is no chance of confusion. 


Remark 9.2 Notice that we have moved the k up to make room for the Alt 
thusL k alt (W;R)cL° k (W;R). 

Thus if w G L k lt (Y), then for any permutation a of the letters 1, 2 , k we 
have 

w(wi,w 2 , W fc ) = sgn(a)u( w^ , w CT 2 , w CT J. 

Now given ui £ L r alt (V) and 77 e L lit (V) we define their wedge product or 
exterior product A 77 € T^ S (V) by the formula 


u A ? 7 (v i; v r ,v r+ i, ...,v r+s ) := 


■ ! s ! 


sgn (a)w(v 0 


-M' 


°’r+ 15 **•> C^r+s 


or by 


u> A ? 7 (“same as above”) := ^ sgn(a)w(v CTl , v 0 . r .)? 7 (v (Tr+ 1 , v^J. 

r,s — shuffles a 

In the latter formula we sum over all permutations such that (Ti < cr 2 < .. < cr r 
and ay+i < tr r _|_ 2 < .. < <r r+s . This kind of permutation is called an r, s — shuffle 
as indicated in the summation. The most important case is for co,g £ L l alt (y ) 
in which case 

(w A 77 ) (v, w) = co(v)g(w) — Lo(w)rj(v) 
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This is clearly a skew symmetric multi-linear map. 

If we use a basis e 1 , e 2 , e n for V* it is easy to show that the set of all 
elements of the form e n A e 12 A • • • A e lk with i\ < ii < ... < ik form a basis for 
. Thus for any u> G A k (V) 

w = ^2 Ae 12 A--- Ae lk 

ii<i2<---<ik 

Remark 9.3 In order to facilitate notation we will abbreviate as sequence of 
k-integers *i, * 2 , from the set {1, 2, ..., dim(V)} as I and e 11 A e 12 A ■ ■ ■ A £ lk 
is written as e 1 . Also, if we require that i\ < *2 < ... < ik we will write I . We 
will freely use similar self explanatory notation as we go along with out further 
comment. For example, the above equation can be written as 

0J = Y, a r £l 

Lemma 9.1 L k lt (Y) = 0 if k > n = dim(V). 

Proof. Easy exercise. ■ 

If one defines L° alt {Y) to be the scalars K and recalling that L\ lt {V) = Y* 
then the sum 

dim(M) 

WV)= 0 L k alt (V) 

k = 0 

is made into an algebra via the wedge product just defined. 

Proposition 9.1 For u> € L r alt (V) and p € L s alt (V) we have wA?] = (— l) rs p A 

The Abstract Grassmann Algebra 

We wish to construct a space that is universal with respect to alternating mul- 
tilinear maps. To this end, consider the tensor space T k ( W) := W fc ® and let A 
be the submodule of T k (W) generated by elements of the form 

Wl ® • Wj ® - ■ ■ ® Wj ■ ■ ■ ® Wfc. 

In other words, A is generated by decomposable tensors with two (or more) 
equal factors. We define the space of k- vectors to be 

WA-AW:=/\‘W:= T k { W) /A. 

Let Afc : W x • • • x W — » T k { W) — * ^ fc W be the canonical map composed with 
projection onto /\ fc W. This map turns out to be an alternating multilinear 
map. We will denote A^(wi, ..., w k) by wi A • • ■ A w k- The pair (/\ A W, A*,) is 
universal with respect to alternating /c- multilinear maps: Given any alternating 
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^-multilinear map a : W x • • • x W — > F, there is a unique linear map a A ; 
/\ k W — > F such that a = a A oAj; that is/\ A 

W x ••• x W F 

Afc I / a A 

A fc w 

commutes. Notice that we also have that wiA- • -Aw k is the image of wj®- • 
under the quotient map. Next we define /\W := J2T=o W and impose the 
multiplication generated by the rule 

a i 

(w! A • • • A w,) x (w'j A • • • A w' ) ^w 1 A"'Aw i Aw' 1 A' ,, Aw' £ /\ W. 

The resulting algebra is called the Grassmann algebra or exterior algebra. If we 
need to have a Z grading rather than a Z + grading we may define f\ k W := 0 
for k < 0 and extend the multiplication in the obvious way. 

Notice that since (w + v) A (w + v) = 0, it follows that w A v = — v A w. 
In fact, any odd permutation of the factors in a decomposable element such as 
wi A • • • A Wfc, introduces a change of sign: 

wi A ■ ■ ■ A Wj A ■ ■ ■ A Wj A ■ ■ ■ A Wfc 

= — wi A • • • A Wj A • • • A w,; A • • • A Wfc 

Just to make things perfectly clear, we exhibit a short random calculation where 
the ring is the real numbers R: 

(2(1 + 2w + w A v + w A v A u) + v A u) A (u A v) 

= (2 + 4w+2w A v+2w A v A u + v A u) A (— v A u) 

= — 2v Au — 4w A v A u — 2wAvAvAu — 2wAvAuAvAu — vAuAvAu 
= — 2v Au — 4wAvAu + 0 + 0 = — 2v A u — 4w A v A u. 

Lemma 9.2 //V is has rank n, then f\ k V = 0 for k > n. If f \ , ..... f n is a 
basis for V then the set 

{fi i A • • • A f ik : 1 < h < ■ ■ ■ < i k < n} 

is a basis for f \ k V where we agree that / tl A • • ■ A fi k = 1 if k = 0. 

The following lemma follows easily from the universal property of a A : 
/\ k W — > F: 

Lemma 9.3 There is a natural isomorphism 

L k u (W ; F) = L(/\^ W ; F) 


In particular, 
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Remark 9.4 (Convention) Let a £ L k lt ( W;F). Because the above isomor- 
phism is so natural it may be taken as an identification and so we sometimes 
write a(vi, ... ,v k ) as a(v\ A • • • A Vk )• 

In the finite dimensional case, we have module isomorphisms 
/\ fe W*-(7\ fe W ) — L k lt (W) 
which extends by direct sum to 

LaitifN) — /\ W* 

which is in fact an algebra isomorphism (we have an exterior product defined 
for both). 

The following table summarizes: 


Exterior Products 

A fc w 

!\ k w* s w) 


Isomorphisms that hold 
in finite dimension 


I 


Alternating multilinear maps 


L k alt { W) 


A w = ® A fc w 

, graded algebra iso. 

A w * = © fc A w* = 


A(W) = ® k L k alt (W) 


9.1.3 Orientation on vector spaces 

Let V be a finite dimensional vector space. The set of all ordered bases fall into 
two classes called orientation classes. 


Definition 9.8 Two bases are in the same orientation class if the change of 
basis matrix from one to the other has positive determinant. 


That is, given two frames (bases) in the same class, say (fi , .../„) and 

(fir -Jn) with 


fi = fjCl 


then det C > 0 and we say that the frames determine the same orientation. The 
relation is easily seen to be an equivalence relation. 


Definition 9.9 A choice of one of the two orientation classes of frames for a 
finite dimensional vector space V is called an orientation on V. The vector 
space in then said to be oriented. 

Exercise 9.6 Two frames, say (fi,...,f n ) and (fi,...,f n ) determine the same 
orientation on V if and only if fi A ... A f n = a /i A ... A /„ for some positive 
real number a > 0. 
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Exercise 9.7 If a is a permutation on n letters {1,2, then (f a i, ■ ■■, fan) 
determine the same orientation if and only if sgn^a) = +1. 

A top form u € L" zt (V) determines an orientation on V by the rule (/j, f n ) ~ 

(7i, •••, In) if and only if 


Furthermore, two top forms Wi, W 2 € L™ lt (V) determine the same orientation on 
V if and only if u>\ = au >-2 for some positive real number a > 0. 


9.2 Multilinear Bundles 

Using the construction of Theorem ?? we can build vector bundles from a 
given vector bundle in a way that globalizes the usual constructions of lin- 
ear and multilinear algebra. Let us first consider the case where the bundle 
has finite rank. Let E —* M be a vector bundle with a transition functions 
$ a p : U a r\Up — > GL(R”). We can construct vector bundles with typical fibers 
R n *, L(R n ,R n ), Tf(R n ), A fc R n , and /\ k R n *. All we have to do is choose the 
correct transition functions. They are the following: 

1. For R n * use the maps <&* a p : U a (~l Up — > GL(R n *) where $* a p{x) = 

2. For L(R",R") = Hom(K n , K") use the maps Hom($ a 0 ) : U a fl Up — > 
GL(L(K”,K”)) given by 

(Hom(^ a p)(x))(A) = $ a p(x) ° A o ^^(x) -1 

3. For TJ (R n ) use Tf$ a p : U a nUp — > GL(TJ (K")) given on simple tensor(?)s 
by 

Tf$ a p(x)(v i ® • • • (g> a s ) = d’a/3(a;)fi ® • • • <8 < f>* /3 (a;)a s . 


4. For /\ fc ]R” use /\ k § a p : U a fl Up —* GL(/\ fc (K")) given on homogeneous 
e(?)lements by 

A / ' < f > a/3(a:)(wi A • • • A v k ) = <L a p(x)v i A • • • A 

In this way we construct bundles which we will denote variously by E* —> 
M, Horn (E, E) -► M, Tf(E) -> M, /\ k E M and so on. 

9.3 Tensor Fields 

The space of tensors of type ( r ) at a point x G M is the just the tensor space 
T r s (T p M) based on T p M (and T*AI ) as discussed in the last section. In case 
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M is finite dimensional we may make the expected identification via the natural 
isomorphisms: 

T r s (T p M) = (: T p M® r ) 0 (! T*M® S ) = (g) r s (T p M) = (T p M® r ) 0 (T*M® S ) 


In a local coordinate frame (ad, ...,x n ) we have an expression at p for elements 
of TJ (T p M), say A p , of the form 


A P = Y, Ail ~ 


d 

3a;* 1 


d 
dx *’ 


0 dad 1 1 0 • • • 0 dad s 


which will abbreviated to 


Mp) = E A 1 j(p)dx i 0 da; J at p 

A assignment of an element A p £ TJ (T p M) for each p £ U C M is called 
tensor field over U and we will normally require that the assignment be made 
smoothly in the sense determined by the next two definitions. 


Definition 9.10 Let the disjoint union of all the spaces Tf(T p M) over the 
points in M be denoted by Tf(TM). This set can be given structure of a smooth 
vector bundle in a manner similar to the structure on TM and is called the (r, s)- 
tensor bundle. (Recall the discussion in the section 9.2 concerning operations 
on vector bundles.) In case M is finite dimensional , we also have the bundle 
(g)((TM) and the natural bundle isomorphism 

(g f s (TM) £* TJ(TM). 

Definition 9.11 A (smooth) tensor field is a section of the vector bundle 
TJ(TM) — > M. The set of all smooth tensor fields is denoted T r s (M). We can 
similarly talk of tensor fields defined on an open subset U of M and so we have 
the space T £(£/) of tensor fields T r s (M) over U. 

The assignment U i— > is a sheaf of modules over C°°. We also have 

the alternative notation Xm(U ) for the case of r = 0, s = 1. 


Proposition 9.2 Let M be an n-dimensional smooth manifold. A map A : 
U — > TJ (M) of the form x i— > A(p) = A p £ TJ (T p M) is a smooth tensor field 
in U if for all coordinate systems (x 1 ,...,x n ) defined on open subsets of U the 
local expression 

A = ^ A J jdxi 0 dx J 
is such that the functions Aj are smooth. 

Now one may wonder what would happen if we considered the space X(f7) 
as a module over the ring of smooth functions C°°(U ) and then took tensors 
products over this ring of r copies of 3Cm(U) and s copies of Xm(U)* to get 


X M (U) 0 • • • 0 X M (U) 0 X M {U)* 0 • • • 0 x M (uy. 


(9.1) 
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Of course, this space is equivalent to a space of multilinear module morphisms 
but furthermore this turns out to be naturally equivalent to the space of sections 
T r s (U) if the manifold is finite dimensional. The proof is an extension of the 
proof of Theorem 7.2. 

Definition 9.12 We can define contraction of tensor fields by contracting on 
each tangent space and if the manifold is infinite dimensional then not all tensors 
can be contracted. 

In any case, we can define contraction against simple tensors. For example, 
let T G T § (C/) , a G Xm{U)* and X € %m(U). Then X 0 a G T \(U) and we 
have ( X ® a)jT € T\(U) given by 

((X®a)_.T)(a 1 ,X 1 ,X 2 ) = C(T ® {X ® a)) = T(a,a 1 ,X,X 1 ,X 2 ) 

Definition 9.13 A k— covariant tensor field T is called symmetric of for all 
k-tuples of vector fields (X\, ...,X k ) and all permutations of k-letters a we have 

r(x al ,...,x ITk ) = r(x 1 ,...,x k ). 

We can similarly define symmetric contravariant. tensors. 

Proposition 9.3 Let {Ui,ifi} be an atlas for a smooth (or C r ) manifold M 
with ipi : Ui — > ipi(Ui). Suppose that we have a smooth (or C r ) tensor field Tj 
of type (r, s) defined on each Ui and that {Ui,pi} is a smooth partition of unity. 
Then the following exists a smooth (or C r ) tensor field T of type (r, s) defined 
on M Furthermore if Tj > 0 for each i, then we can construct T of the same 
type as T, such that T > 0. 

Proof. Define a global tensor field for each i by extending pi Tj by zero 
(why is this smooth?). Now define 

t = E^' t t 

3 

This is well defined and smooth at each point of M since the sum is finite in 
some neighborhood of every point. Also, if Tj > flj for each i then if x G Ui we 
have that Pj( x ) = 1 ( a finite sum) so p k {x) > 0 for some k. Thus 

T ( x ) = ^ Pk{x)T k (x) > p k (x) > 0. 

3 


9.4 Tensor Derivations 

We would like to be able to define derivations of tensor fields . In particular 
we would like to extend the Lie derivative to tensor fields. For this purpose we 
introduce the following definition which will be useful not only for extending 
the Lie derivative but can also be used in several other contexts. Recall the 
presheaf of tensor fields U i— > T r s (U) on a manifold M. 
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Definition 9.14 A differential tensor derivation is a collection of maps T> r s \ v '■ 
T, r s {U) — > Tg(J7), all denoted by V for convenience, such that 

1. V is a presheaf map for considered as a presheaf of vector spaces over 
R. In particular, for all open U and V with V C U we have 

VT\ v = VT\ v 

for allT &T r s (U). 

2. V commutes with contractions against simple tensors. 

3. V satisfies a derivation law. Specifically, for Ti G 2I r s {U) and T 2 G Tj.(£/) 
we have 

2?(Ti <81 T 2 ) = VT 1 ® T 2 + Ti ® vr 2 . 

The conditions 2 and 3 imply that for T G T r s (U), a\ ,...,a r G X*(U) and 
X\, ...,X S G X(U) we have 

D(T(ai, ...,a r ,Xi,...,X s )) = VT(ai, ...,a r ,X i,...,X s ) 

T ^ ' T (oq , .... Do.i , ..., oc r , Xi , ..., X s ) 

i 

+ T(ai, ...,a r , X\, VXi, X s ). 

i 

This is follows by noticing that 

T(«i, a r , X 1: ..., X s ) = C(T ® (ai ® ■ ■■ ® a r ® X 1 ® • • • ® X s )) 

and applying 1 and 2. Also, in the case of finite dimensional manifolds (2) can 
be replaced by the statement that D commutes with contractions (why?). 

Proposition 9.4 Let M be a finite dimensional manifold and suppose we have 
a map on global tensors V : T r s (M) — > % r s (M) for all r,s nonnegative integers 
such that 2 and 3 above hold for U = M . Then there is a unique induced tensor 
derivation which are agrees with V on global sections. 

Proof. We need to define V : T r s (U) — > T r s (U) for arbitrary open U as a 
derivation. Let 5 be a function that vanishes on a neighborhood of V of p G U. 

Claim 9.1 We claim that (' D6)(p ) = 0. 


Proof. To see this let f3 be a bump function equal to 1 on a neighborhood 
of p and zero outside of V. Then S = (! — /?)<$ and so 


VS(p) = V((l-P)6)(p) 
5{p)V{l - /3) (p) + (1 - (3(p))V8(p) = 0 



9.4. TENSOR DERIVATIONS 


151 


Given r £ T r s (U ) let (3 be a bump function with support in U and equal to 
1 on neighborhood of p £ U. Then (3t € % r s (M) after extending by zero. Now 
define 

(Vr)(p) = D((3 t)(p). 

Now to show this is well defined let (3 2 be any other bump function with support 
in U and equal to 1 on neighborhood of po £ U. Then we have 

^(/3t)(po) - X>(/3 2 t)(po) 

= £>(/3t) - X>(/3 2 t))(po) = £>((/? - /3 2 )t)(po) = 0 

where that last equality follows from our claim above with 6 = /3 — /? 2 - Thus V is 
well defined on T r s (U). We now show that Vt so defined is an element of T r s (U). 
Let %l> a , U a be a chart containing p. Let : U a — > V a C R n . Then we can write 
r\ Ua £ T r s (U a ) as r Ua = 7 ^*’ " ' ® • • • <g> dx % * ® ® . We can use 

this to show that Vt as defined is equal to a global section in a neighborhood of 
p and so must be a smooth section itself since the choice of p £ U was arbitrary. 
To save on notation let us take the case r = 1, s = 1. Then Tjj a = rl-dxf ® 

Let (3 ne a bump function equal to one in a neighborhood of p and zero outside 
of U a n U. Now extend each of the sections /3rj £ $(U), (3dxi £ T?(f7) and 
(3-g^i £ t %o(U) to global sections and apply V to P 3 t\ v = f3T l jf3dx : > ® (3g |i to 
get 

= V((3 3 T) = V((3T}(3dxi®(3 ^-) 

O A 

= V{fai)pdx* ® (3— + (3rp((3dxi) ® (3— 

■ p Ur' <® D(/? A) 

Now by assumption V takes smooth global sections to smooth global sections so 
both sides of the above equation are smooth. On the other hand, independent 
of the choice of (3 we have V((3 3 r)(p) = V(t)(p) by definition and valid for all p 
in a neighborhood of po- Thus V(t) is smooth and is the restriction of a smooth 
global section. We leave the proof of the almost obvious fact that this gives a 
unique derivation V : T r s (U) — » T r s (U ) to the reader. ■ 

Now one big point that follows from the above considerations is that the 
action of a tensor derivation on functions, 1-forms and vector fields determines 
the derivation on the whole tensor algebra. We record this as a theorem. 

Theorem 9.2 Let V\ and V -2 be two tensor derivations (so satisfying 1,2, and 
3 above) which agree on functions, 1-forms and vector fields. Then V\ = I? 2 . 
Furthermore, ifVjj can be defined on 3(U) and X(U) for each open U C M so 
that 


1- Vuift&g) = Vjj f ® 5 + / ® V v g for all f,g £ $(U), 
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2. for each f G £(M) we have{V M f)\u = Du f\ v , 

3. V v (f <g> X) = Du f ®X + f® VuX for all f G $(U) and X G X(U), 

f. for each X G X(M) we have(T>MX) \ v = Du X\ v , 

then there is a unique tensor derivation D on the presheaf of all tensor fields 
that is equal to Du on 3(U) and X(U) for all U. 

Sketch of Proof. Define V on X*(U) by requiring T>u(oi ® X) = Dua ® 
X + a ® DuX 

so that after contraction we see that we must have (Dua)(X) = T>u(ce(X)) — 
a(DuX). Now using that D must satisfy the properties 1,2,3 and 4 and that we 
wish T> to behave as a derivation we can easily see how T> must act on any simple 
tensor field and then by linearity on any tensor field. But this prescription can 
serve as a definition. ■ 


Corollary 9.1 The Lie derivative Cx can he extended to a tensor derivation 
for any X G X(M). 

We now present a different way of extending the Lie derivative to tensors that 
is equivalent to what we have just done. First let T G T' S (U) If <f) : U — » <t>(U) 
is a diffeomorphism then we can define <f)*T G T g(U) by 

(0*T)(p)(ai, ..., a r , v 1 , ...,v s ) 

= T (0(p))(T V" 1 • ai, .... T*cf~ X ' Tcf ■ v 1 , ..., Tcf ■ v s ). 


Now if X is a complete vector field on M we can define 


£a-t 



(Flf*T) 


just as we did for vector fields. Also, just as before this definition will make 
sense point by point even if X in not complete. 

The Lie derivative on tensor fields is natural in the sense that for any dif- 
feomorphism (p : M — ► N and any vector field X we have 


£<l>.x <t>*T = 4>*C X T. 

Exercise 9.8 Show that the Lie derivative is natural by using the fact that it 
is natural on functions and vector fields. 



Chapter 10 

Differential forms 


Let M be a smooth manifold of dimension n = dim M. We form the natural 
bundle L k alt (TM) which has as its fiber at p the space L k lt (T p M). Let the 
smooth sections of this bundle be denoted by 

fl k (M)=T(M-L k alt (TM)). (10.1) 

and sections over U C M by This space is a module over the ring of 

smooth functions C°°(M) = T(U). We have the direct sum 

n M (U) = = r (u,ZnZ M Lalt(TM)) . 

which is a Z + -graded algebra under the exterior product 


(uj A g){p)(v 1 ,v 2 , ..,v r ,v r+1 ,v r+2 , -,v r+s ) 

= ) w W (t'a! . V - Vl'if V +1 > V+2 1 V+.) 

r,s — shuffles a 

for u> e Tl r M {U) and rj € fl s M (U). 

Definition 10.1 The sections of the bundle TIm{U) are called differential 
forms on U. We identify with the obvious subspace ofQ.M{U ) = 

fl k M (U). A differential form, in il k M {U) is said to be homogeneous of degree 
k and is referred to a “k-form”. 

Whenever convenient we may extend this to a sum over all n € Z by defining 
(as before) TL k M {U) := 0 for n < 0 and Sl^(f7) := 0 if n > dim(M). This is a 
Z-graded algebra under the exterior (wedge) product that is inherited from 
the exterior product on each fiber L k lt (T p M)-,we have 

(lo A g)(p)(vi,v 2 , ..,v r ,v r+ i,v r+ 2 , ..,v r+s ) 

= sgnfff )w(p) (Vai , v , - , ^Jfi(v +1 . ^r+2 . *V r+s ) 

r,s— shuffles a 
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Of course, we have made the trivial extension of A to the Z-graded algebra 
by declaring that u A 77 = 0 if either 77 or lo is homogeneous of negative degree. 

The assignment U i— > 12 m (17) is a presheaf of modules over C Similar 
remarks hold for il k M the (presheaf of) homogeneous forms of degree k. Sections 
from 1 2 (M) are called (global) differential forms or just forms for short. 

Just as a tangent vector is the infinitesimal version of a curve through a 
point so a 

10.1 Pullback of a differential form. 

Given any smooth map / : M — •> N we can define the pullback map f* : 

1 1(N) — > 1 2(M) as follows: 

Definition 10.2 Let 77 £ 1 l k (N). For vectors v ±, ..., £ T p M define 

(f*v) (P)(vi,-,v k ) = {T p Vh ..., T p Vk) 

then the map f*r] : p — > (/*??) (p) is a differential form on M. f*rj is called the 

pullback of 77 by f. 

One has the following easy to prove but important property 
Proposition 10.1 With f : M — » N smooth map and 771,772 G Sl(N) we have 

f*(Vi A 772 ) = f*rji A f * 772 

Proof: Exercise 

Remark 10.1 Notice the space is just the space of smooth functions 

C°° (U) and so unfortunately we have several notations for the same set: C°° (U ) = 

c%(u) = f M (u) = n° M (u). 

All that follows and much of what we have done so far works well for 12 j v/(17) 
whether U = M or not and will also respect restriction maps. Thus we will 
simply write 12 m instead of 12 m(U) or 12 (M) and X M instead of X(27) so forth 
(recall remark 6.1). In fact, the exterior derivative d commutes with restrictions 
and so is really a presheaf map. 

The algebra of smooth differential forms 12(17) is an example of a Z graded 
algebra over the ring C°°(U) and is also a graded vector space over R. We have 
for each U C M 

1 ) a the direct sum decomposition 

12(17) = • • • © 12~ 1 (f7) © 12° (17) © 12 1 (f7) © 12 2 (17) • • • 

where 12 k (U) = 0 if k < 0 or if k > dim(17); 

2) The exterior product is a graded product: 

a A p £ O k+l {U) for a £ n k {U) and (3 £ O l {U) 

which is 

3) graded commutative: a A (3 = (— \) kl (3 A a for a £ Ll k (U) and (3 £ Ll l (U). 
Each of these is natural with respect to restriction and so we have a presheaf 

of graded algebras. 



156 


CHAPTER 10. DIFFERENTIAL FORMS 


10.2 Exterior Derivative 

Here we will define and study the exterior derivative d. 

Definition 10.3 A graded derivation of degree r on f 1 := Dm is a sheaf map 
D : Q, — > 0, such that for each U C M , 

v : n k {u) -► n k+r {u ) 

and such that for a € Ll k (U) and {3 € 0(U) we have 

V(a A /?) = Va A /? + {-l) kr a A V(3. 

Along lines similar to our study of tensor derivations one can show that a 
graded derivation of 0(U) is completely determined by, and can be defined by 
it action on 0-forms (functions) and 1-forms. In fact, since every form can be 
locally built out of functions and exact one forms, i.e. differentials, we only need 
0-forms and exact one forms to determine a graded derivation. 

The differential d defined by 

df(X) = Xf for X gX m (10.2) 

is a map — > IP M . This map can be extended to a degree one map from 

the graded space II m to itself. Degree one means that writing d : 0° M — > If' M 
as do : 0° M — > Il [ M we can find maps di : IV M —> that will satisfy our 

requirements. 

Let a>u : U — » L k lt ( M; M). In the following calculation we will identify 
L k lt ( M; M) with the L(A fc M, M). For £ 0 > ...,£ fe maps £ * : U — > M we have 

D(u)u,£ o, ■ £» 

= 4 + <(.). Co(i + <(.) A ... A A ... A ( t (a; + t(,)) 

dt „ 

= (uu(x), ^ [£o(z + tfi) A ... A A ... A £ fc (x + t£;)]) 

+ ( 4 wt/(a;),£o(a: + f&) A ... A£ A ... A £ fc (:r + t&)) 

dt o 

2-1 

= {uu(x), X^(- 1 ) J ^'( a; )^ A A ••• A C' A ••• A & A ••• A &(*)]) 

i=o 

k 

+ (wc/(a;), ^ (-l)' '£,(./■)£, A [£ 0 (x) A ... A£ A ... A £j A ... A £ fc (a;)]) 
l=*+i 

+ (w[rW?i, £o0*0 A ••• A £i A ••• A £fc(z)} 
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Theorem 10.1 There is a unique graded (sheaf) map d : Dm — > Om, called 
the exterior derivative , such that 

1) d o d = 0 

2) d is a graded derivation of degree one, that is 


d(a A ft) = (da) A (3 + (— l) fe a A (d/3) (10.3) 


for a&D k M . 

Furthermore, if ui G D k (U) and Xq, X \, ..., X k G Xm(U) then 
du= Y, (-iyXi(u(Xo,--,Xi>~#X k )) 

0<i<k 

+ Y (-i) i+j u((Xi,x j ],x 0 ,...X,..,x j ,.--,x k ). 

0<i<j<k 


In particular, we have the following usefid formida for u G Or M and X, Y G 
Xm(U) : 


du)(X, Y) = X(lv(Y)) - Yw(X) - u>([X, F]) 


Proof. First we give a local definition in terms of coordinates and then 
show that the global formula ?? agree with the local formula. Let U, if be a 
local chart on M. We will first define the exterior derivative on the open set 
V = 4>(U) C R". Let £oi be local vector fields. The local representation 
of a form w is a map Uu ■ V —> L k kew ( R n ;R) and so has at some x G V has a 
derivative Du>u(x) G L( R n ,L k kew ( R n ;R)). We define 

k 

diVu(x)(£ o,"-,60 := 

i= 0 

where DiOu(x)fi(x) G L k kew ( R n ; R n ). This certainly defines a differential form 
in D k+1 (U). Let us call the right hand side of this local formula LOC. We 
wish to show that the global formula in local coordinates reduces to this local 
formula. Let us denote the first and second term of the global when expressed 
in local coordinates LI and L2. Using, our calculation 10.2 we have 

k k 

LI = Y(-mMd 0 , .... 6, .... 60) = £(-i)‘i>(w(6>, .... 6 , .... 60)00600 

2—0 2—0 
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k i— 1 

= (uu{ x), £(-1)* ^(-l)^(z)6 A [&(*) A ... A Zj A ... A 6 A ... A &(*)]> 
i=0 0 


k k 

+ (wj/(a;),^(-l) 1 A [£oOe) A ... A 6 A ... 

2—0 j=i+l 

... A £j A ... A 6=0*0]) 

k 

+ i u u( x )^ 6)0*0 A ... A 6 A ... A 6=0*0) 

i=0 

fc fc 

= E XJ ( l)' : '(6(.r)6 - 6(^)6) A 6)0*0 A ... 

i=0 i<j 


= LOC + L2. 


.... A 6 A ••• A 6 A ... A 6=0*0) 


So our global formula reduces to the local one when expressed in local coordi- 
nates. ■ 

Remark 10.2 As we indicated above, d is a local operator and so commutes 
with restrictions to open sets. In other words, if U is an open subset of M and 
djj denotes the analogous operator on the manifold U then = (da)\ u - 

This operator can thus be expressed locally. In order to save on notation we will 
use d to denote the exterior derivative on any manifold, forms of any degree 
and for the restrictions djj for any open set. It is exactly because d is a natural 
operator that this will cause no harm. 

If if = (a: 1 , . . .x n ) is a system of local coordinates on an open set U then 
all a £ m(U ) are sums of terms of the form fdx 11 A • • • A dx lk = fdx 1 where 
I — (ii, ...,ik) is a strictly increasing sequence of element from {1, ...,n}. Then 
we have 

d(/da/) = df A da/. (10-4) 


written out this is 

Lemma 10.1 Given any smooth map f : M — > N we have that d is natural 
with respect to the pull back: 


f*{drj) = d(f* V ) 


Proposition 10.2 dd = 0 

Proof. This result is an easy but boring exercise in bookkeeping and boils 
down to the fact that for smooth ( C 1 ) functions mixed partial derivatives are 
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equal. For example, 




d 2 


dxWx * 

d 2 


- 19 ^ dx j A dx i + Y' 

^ dxidx* ^ dxi dxi 

3<l j>l 


dx-i A dx l 
dx-i A dx % 


y ( d ' 2f . 


j<i 


d 2 f 

\dxWxi dx l dxl 


dx j A dec* = 0. 


Now the general result follows by using 10.3. 


Definition 10.4 A smooth differential form a is called closed if da = 0 and 
exact if a = d/3 for some differential form (3. 


Corollary 10.1 Every exact form is closed. 


The converse is not true in general and the extend to which it fails is a 
topological property of the manifold. This is the point of the De Rham coho- 
mology to be studied in detail in chapter 15. Here we just give the following 
basic definition: 


Definition 10.5 Since the exterior derivative operator is a graded map of de- 
gree one with d 2 = 0 we have, for each i, the de Rham cohomology group 
(actually vector spaces) given by 


t keijd : W(M) n <+1 (M)) 

1 ’ Im(d : — > 0*(M)) ' 


(10.5) 


In other words, we look at closed forms (forms a for which da = 0) and identify 
any two whose difference is an exact form ( a form which is the exterior derivative 
of some other form). 


10.3 Maxwell’s equations. 


Recall the electromagnetic field tensor 


0 

E x 

Ey 

E z 

—E x 

0 

-B z 

By 

— Ey 

B z 

0 

-B 

~E Z 

— By 

B x 

0 


Let us work in units where c = 1. Since this matrix is skew symmetric we can 
form a 2-form called the electromagnetic field 2-form: 

F = 1 y F^dx* A dx u = y F^dx^ A dx v . 

11, V [1<V 
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Let write E = E x dx + E y dy + E z dz and B = B x dy Adz + B y dz Adx+ B z dx Ady. 
One can check that we now have 


F = E A dt — B. 


Now we know that F comes from a potential A = A u dx u . In fact, we have 
cL4 = d{A v dx v ) = 

fl<lA 

= Yj F/ii/dx^ A dx" = F. 

Thus we automatically have dF = ddA = 0. Now what does dF = 0 translate 
into in terms of the E and B1 We compute: 


- -^—A^dx^ A dx" 
ox v 


dF = d(E A dt — B) = dE A dt — dB 
= d ( E x dx + Eydy + E z dz) A dt 


-( 


dB x dB v 


dx 


dy 


OIL 

dz 


) 


,dE z dEy ,8E X dE z ., , / 

L ( iy - s^ )dy '' dz+( ^- - 


dEy 8E x 

dy 


dx 


)dy A dx 


A dt 


dB dB x 

+ 7n Adt ~ , lSr 


dB„ dB. 


dy 


dz 


)dx Ady A dz. 


From this we conclude that 


div (B) = 0 
curl(E) + = 0 

which is Maxwell’s first two equations. Thus Maxwell’s first two equations end 
up being equivalent to just the single equation 


dF = 0 


which was true just from the fact that dd = 0! 

As for the second pair of Maxwell’s equations, they too combine to give a 
single equation 

* F = J. 

Here J is the differential form constructed from the 4-current j = (p,j) intro- 
duced in section 26.4.8 by letting (jo- ji ; j'lO.i) = (p, — j) and then J = j^dx^. 
The * refers to the Hodge star operator on Minkowski space. The star operator 
on a general semi-Riemannian manifold will be studied later but we can give a 
formula for this special case. 



10.4. LIE DERIVATIVE, INTERIOR PRODUCT AND EXTERIOR DERIVATIVE. 161 


Definition 10.6 Define e{fi) to be entries of the diagonal matrix A = diag( 1, —1, —1, — 1). 
Let * be defined on fi fc (R 4 ) by letting *(dx ll A- ■ ■ Adx lk ) = ±e(ji)e(j 2 ) • • • e(jk)dxA A 
• • • A dx in ~ k where dx 11 A • • • A dx lk A dx Jl A • • • A dx 4 n ~ k = ±dx° A dx 1 A dx 2 A dx 3 . 

( Choose the sign to that makes the last equation true and then the first is true 
by definition). Extend * linearly to a map il fe (K 4 ) — » f2 4_fc (R 4 ). 

Exercise 10.1 Show that * o * acts on O fc (K 4 ) by (— l) fe ( 4_fe )+ 1 . How would * 
be different 

Exercise 10.2 Show that *d * F = J is equivalent to Maxwell's second two 
equations 


~ (Jr j ~ 

curl(B) = —+j 
div(E) = p. 

10.4 Lie derivative, interior product and exte- 
rior derivative. 

The Lie derivative acts on differential forms since the latter are, from one view- 
point, tensors. When we apply the Lie derivative to a differential form we get 
a differential form so we should think about the Lie derivative in the context of 
differential forms. 

Lemma 10.2 For any X £ X(M) and any f £ we have Cxdf = dCxf- 

Proof. For a function / we compute as 

(C X <V)(Y) 

= (±(Fl?ydf){Y)=j t df(TFl?-Y) 

= ±Y((Fl?rf) = Y(j t (Fl?y f ) 

= Y(Cxf) = d(C x f){Y) 

where Y £ X(M) is arbitrary. ■ 

Exercise 10.3 Show that Cx{a A ff) = Cxa. A (3 + a A CxP- 

We now have two ways to differentiate sections in f l(M). Once again we 
write !!m instead of f l(U) or O(M) since every thing works equally well in 
either case. In other words we are thinking of the presheaf Dm '■ U i— > f 2(17). 
First, there is the Lie derivative which turns out to be a graded derivation of 
degree zero; 

Cx '■ — > 


( 10 . 6 ) 
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Second, there is the exterior derivative that we just introduced which is a 
graded derivation of degree 1. In order to relate the two operations we need a 
third map which, like the Lie derivative, is taken with respect to a given field 
X G r (U;TM). This map is a degree —1 graded derivation and is defined by 

lxu(X 1 ,...,X i _ 1 )=u(X,X 1 ,...,X i _ 1 ) (10.7) 

where we view to G IV M as a skew-symmetric multi-linear map from Xm x • • • x 
Xm to C^- We could also define lx as that unique operator that satisfies 

ixO = 0(X) for 6 G Il\ 4 and X G Xm 

Lx{ot A (3) = (ixot) A /3 + (— l) fc A a A (ixP) for a G O k M . 

In other word, lx is the graded derivation of Dm of degree —1 determined by 
the above formulas. 

In any case, we will call this operator the interior product or contraction 
operator. 

Notation 10.1 Other notations for lxoj include X_ilo = ( X,oj ). These nota- 
tions make the following theorem look more natural: 

Theorem 10.2 The Lie derivative is a derivation with respect to the pairing 
( X , lo) . That is 

Cx {X, u>) = (CxX,uj) + (X,C x w) 
or 

Cx{X_iu>) = (CxX )_ iuj + Xj(£.yw) 

Using the “lx” notation: Cx{rxu) = lc x xu + ixC-x^ (not as pretty). 

Proof. Exercise. ■ 

Now we can relate the Lie derivative, the exterior derivative and the con- 
traction operator. 

Theorem 10.3 Let X G Xm- Then we have Cartan’s homotopy formula; 

Cx — d o Lx A Lx ° d ( 10 . 8 ) 

Proof. One can check that both sides define derivations and so we just have 
to check that they agree on functions and exact 1-forms. On functions we have 
L X f = 0 and Lxdf = X f = C x f so formula holds. On differentials of functions 
we have 

(< d o lx + ix ° d)df = (do i x )df = dC x f = C x df 

where we have used lemma 10.2 in the last step. ■ 

As a corollary can now extend lemma 10.2: 


Corollary 10.2 d o Cx = Cx ° d 
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Proof. 


dC\a = d(di x + i'xd)(a) 
= dLxda = dixda + i\dda = Cx ° d 


Corollary 10.3 We have the following formulas: 

1) b [x,Y] = £x ° Ly + ly ° C,x 

2) Cfxu = f tdxw + df A ixw for all to £ f 2(M). 

Proof. Exercise. ■ 


10.5 Time Dependent Fields (Part II) 

If we have a time parameterized family of p- forms at on M such that at(x) := 
aft, x) is jointly smooth in t and x, i.e. a time dependent p-form, then we can 
view it as a p-form on the manifold R x M. Of course, aft , x) might only be 
defined on some open neighborhood of {0} x M ClxM but we shall assume 
that aft, x) is defined on all of R x M. This is only for simplicity in notation 
and does not effect the results in any essential way. 


10.6 Vector valued and algebra valued forms. 

Given vector spaces V and W, one can also define the space T°(V; W) = Lf. kew 
(V; W) of all (bounded) skew-symmetric maps 


Vx Vx ••• x V->W 
^ y ^ 
k— times 

We define the wedge product using the same formula as before except that we 
use the tensor product so that a A (3 € T k skew (V; W ® W) : 


(w A ri)(v 1 ,v 2 , ..,v r ,v r+ i,v r+2 , ..,v r +s ) 

= X! s gn(<^)w(v , »a 2 , - , V) ® I i( V +1 > V +2 > V+J 

r,s — shuffles a 

Globalizing this algebra as usual we get a vector bundle W (S> (f\ k T*M) which in 
turn gives rise to a space of sections Ll k (M,W) (and a presheaf U i— > Ll k (U,W)) 
and exterior product fl k (U,W) x Ll l (U,W) — > Ll k+l (U,W (g> W). The space 
f l k {U, W) is a module over Cf^fU). We still have pullback and a natural exterior 
derivative 


d : Ll k (U, W) -► fi fc+1 (C/,W) 
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defined by the formula 

doj(X o, X k ) 

= Y {-iyx l {u{x 0 ,...7x. l ,...,x k )) 

1 <i<k 

+ Y (-i) <+ Mx 0 , .... [Xi, Xj], ..., x k ) 

where now lu(X o, Xi, ..., X k ) is a W-valued function so we take 

Xi(u(X o, Xi , ..., X k ))(p) 

= Du(X 0 ,...yXi,...,X k )\ .Xi(p) 

= d(u(X 0 ,...7x i ,...,X k ))(X i (p)) 

which is an element of W under the usual identification of W with any of its 
tangent spaces. 

To give a local formula valid for finite dimensional M, we let fi, f n be 
basis of W and (a: 1 , x n ) local coordinates defined on U. For oj = J2 a p fj®dx 
we have 

du = d(fj 8 Y^ a T jdx 1 ) 

= Y(fj ® da i • A dx 1 ). 

The elements fj 8 dx p form a basis for the vector space W 8 (/\ k T*M ) for every 
p € U. 

Now if W happens to be an algebra then the algebra product WxW-*W 
is bilinear and so gives rise to a linear map to: W 8 W — > W. We compose 

m , 

the exterior product with this map to get a wedge product A : fi (t/,W) x 

n l {u,w) -► n k +\u,w) 

771 

(u A r/)(vi,v 2 , ..,v r ,v r+1 ,v r+ 2 , ~,i V+s) 

= Y s g n ( CT ) TO At,.) ®^(A7 r+1 ,AT r+2 >-->Ar r+ j) 

r,s— shuffles cr 

= Y sgn(<r)w( v , u ff2> „, v) ■ >)( V+u V+ 2 > ^rt.) 

r,s— shuffles cr 

A particularly important case is when W is a Lie algebra g with bracket 

m 

Then we write the resulting product A as [., ,] A or just [., .] when there is no risk 
of confusion. Thus if u,r] £ 0 1 (?7, g) are Lie algebra valued 1-forms then 

W>v] P0 = [w(X),??(y)] + [tj(X),uj(Y)]. 

In particular, \{u>,ui\{X,Y) = (y(X), w(Y)] which might not be zero in general! 


P 
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10.7 Global Orientation 

A rank n vector bundle E — > M is called oriented if every fiber E p is given 
a smooth choice of orientation. There are several equivalent ways to make a 
rigorous definition: 

1. A vector bundle is orientable iff has an atlas of bundle charts such that 
the corresponding transition maps take values in GL + (n,R) the group of 
positive determinant matrices. If the vector bundle is orientable then this 
divides the set of all bundle charts into two classes. Two bundle charts are 
in the same orientation class the transition map takes values in GL + (n, R). 

If the bundle is not orientable there is only one class. 

2. If there is a smooth global section s on the bundle /\ n E — > M then we say 
that this determines an orientation on E. A frame (/i, ..., /„) of fiber E p 
is positively oriented with respect to s if and only if /i A ... A /„ = as(p ) 
for a positive real number a > 0. 

3. If there is a smooth global section u> on the bundle f\ n E* = L^ lt (E) — > M 
then we say that this determines an orientation on E. A frame (/i, ..., /„) 

of fiber E p is positively oriented with respect to w if and only > 

0. 

Exercise 10.4 Show that each of these three approaches is equivalent. 

Now let M be an n-dimensional manifold. Let U be some open subset of 
M which may be all of M. Consider a top form, i.e. an n-form m £ SYh(U) 
where n = dim(M) and assume that m is never zero on U. In this case we will 
say that w is nonzero or that w is a volume form. Every other top form p is 
of the form p = fw for some smooth function /. This latter fact follows easily 
from dim(/\' ! T p M) = 1 for all p. If ip : U — > U is a diffeomorphism then we 
must have that < p*vo = 5m for some 5 £ C°°(U) which we will call the Jacobian 
determinate of <p with respect to the volume element m: 

p*m = J^(p)m 

Proposition 10.3 The sign of is independent of the choice of volume 

form m. 

Proof. Let m' G Ll m(U). We have 


'UJ — CLTaJ 

for some function a which is never zero on U. We have 

J(p)m = (p*m) = (a o ip)(p*m') 

= (a o p)J^i(ip)m = m 


a 
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and since > 0 and zu is nonzero the conclusion follows. ■ 

Let us consider a very important special case of this: Suppose that ip : U — > 
U is a diffeomorphism and U C R". Then letting zug = du 1 A • • • A du n we have 


p*zug(x) = p*du 1 A p* ■ ■ ■ A (p*du n (x ) 


= det 


d{v} o p) 


du n 

d(u l o ip) 

dui~ 


du 11 A 


ME 


d{u n o ip) 


du 1 


du 1 


(a:) = Jtp(x). 


so in this case </ roo (<p) is just the usual Jacobian determinant of ip. 


Definition 10.7 A diffeomorphism ip : U — > U C R n is said to be positive or 
orientation preserving if det(Tip) > 0. 


More generally, let a nonzero top form zu be defined on U C M and let zu' be 
another defined on U' C N. Then we say that a diffeomorphism ip : U — > U' is 
orientation preserving (or positive) with respect to the pair zu, zu' if the unique 
function J TOro / such that ip*vu’ = u is strictly positive on U. 

Definition 10.8 A differentiable manifold M is said to be orientable iff there 
is an atlas of admissible charts such that for any pair of charts ip a ,U a andxpp, Up 
from the atlas with U a D Up M 0 , the transition map ipp o ip- 1 is orientation 
preserving. Such an atlas is called an orienting atlas. 


Exercise 10.5 The tangent bundle is a vector bundle. Show that this last def- 
inition agrees with our definition of an orientable vector bundle in that M is 
an orientable manifold in the current sense if and only if T M is an orientable 
vector bundle. 


Let Am be the maximal atlas for a orientable differentiable manifold M. 
Then there are two sub-atlas A and A! with A U A! = Am, A ("I A! = 0 and 
such that the transition maps for charts from A are all positive and similarly 
the transition maps of A! are all positive.. Furthermore if ip a ,U a £ A and 
ipp,Up € A! then ippoip- 1 is negative (orientation reversing). A choice of one 
these two atlases is called an orientation on M. Every orienting atlas is a 
subatlas of exactly one of A or A! ■ If such a choice is made then we say that M 
is oriented. Alternatively, we can use the following proposition to specify an 
orientation on M: 

Proposition 10.4 Letm€fl n (M) be a volume form on M , i.e. zu is a nonzero 
top form. Then zu determines an orientation by determining an (orienting) atlas 
A by the rule 

ip a , U a £ A <==> ip a is orientation preserving resp. zu,zuq 
where zug is the standard volume form on K n introduced above. 
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Exercise 10.6 Prove the last proposition and then prove that we can use an 
orienting atlas to construct a volume form on an orient-able manifold that gives 
the same orientation as the orienting atlas. 

We now construct a two fold covering manifold Or (M) for any finite dimen- 
sional manifold called the orientation cover. The orientation cover will itself 
always be orientable. Consider the vector bundle /\" T*AI and remove the zero 
section to obtain 


T*M) X := T*M — {zero section} 

Define an equivalence relation on (f\ n T* M) x by declaring v\ ~ iff v\ and V 2 
are in the same fiber and if v\ = av 2 with a > 0. The space of equivalence classes 
is denoted Or(M). There is a unique map 7r or making the following diagram 
commute: 

Or(M) -» (A n T*M) x 

I 

r^Or A. 

M 

Now Or(M) — » Jlf is a covering space with the quotient topology and in fact is 
a differentiable manifold. 


10.8 Orientation of manifolds with boundary 

Recall that a half space chart if a for a manifold with boundary M is a bijection 
(actually diffeomorphism) of an open subset U a of M onto an open subset of 
R? _ . A C r half space atlas is a collection i[> a , U a of such charts such that for 
any two; ip a , U a and ip/hUp, the map Aa ° V’^ 1 is a C r diffeomorphism on its 
natural domain (if non-empty) . Note: “Diffeomorphism” means in the extended 
the sense of a being homeomorphism and such that both ip a o ijj^ 1 :: R" _ — > R" 
and its inverse are C r in the sense of definition 2.5. 

Let us consider the case of finite dimensional manifolds. Then letting R” = 
R™ and A = pr\ : R™ — > R we have the half space R)( _ = R” 1<0 . The funny 
choice of sign is to make R^ _ = R™i<o ra tl ier than R" 1>0 - The reason we do 
this is to be able to get the right induced orientation on dM without introducing 
a minus sign into our Stoke’s formula proved below. The reader may wish to 
re-read remark 2.6 at this time. 

Now, imitating our previous definition we define an oriented (or orienting) 
atlas for a finite dimensional manifold with boundary to be an atlas with ranges 
all in R™i<o anc ^ suc h that the overlap maps if a ° :: R"i< 0 — » R"i< 0 are 

orientation preserving. A manifold with boundary with a choice of (maximal) 
oriented atlas is called an oriented manifold with boundary. If there exists 
an orienting atlas for M then we say that M is orientable just as the case of 
a manifold without boundary. 
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Now if A = {{ipa, U a )} a ^A is an orienting atlas for M as above with domains 
in K"i< 0 then the induced atlas {{ip a \ u ndM > U a r\dM)} ae A is an orienting atlas 
for the manifold d M and the resulting choice of orientation is called the induced 
orientation on OM . If M is oriented we will always assume that dM is given 
this induced orientation. 

Definition 10.9 A basis fi, fi, ..., f n for the tangent space at a point p on an 
oriented manifold (with or without boundary) is called positive if whenever 
if a = (a; 1 , x n ) is an oriented chart, on a neighborhood of p then ( dx 1 A ... A 
dx n )(fl,f 2 ,-,fn) > 0. 

Definition 10.10 A vector v in T p M for a point p on the boundary dM is 
called outward pointing if T p ip a ■ v £ is outward pointing in the sense that 
\(T p ijj a ■ v) > 0. 


Since we have chosen A = pr\ and hence R" = R™i< 0 for our definition in 
choosing the orientation on the boundary we have that in this case v is outward 
pointing iff T p ip a ■ v € R™i< 0 - 

Definition 10.11 A nice chart on a smooth manifold (possibly with boundary) 
is a chart ip a ,U a where ip a is a diffeomorphism onto R"i< 0 if U a C\dM ^ 0 and 
a diffeomorphism onto the interior R”i <0 if U a D dM = 0. 

Lemma 10.3 Every (oriented) smooth manifold has an (oriented) atlas con- 
sisting of nice charts. 

Proof. If ip a , U a is an oriented chart with range in the interior of the left 
half space R" 1<0 then we can find a ball B inside ip a {U a ) in R™ 1<0 and then 
we form a new chart on ip~ l (B) with range B. But a ball is diffeomorphic to 
]R™ 1<0 . So composing with such a diffeomorphism we obtain the nice chart. If 
ip a , U a is an oriented chart with range meeting the boundary of the left half 
space R™i< 0 then we can find a half ball _B_ in R"i <0 with center on R"i_ 0 . 
Reduce the chart domain as before to have range equal to this half ball. But 
every half ball is diffeomorphic to the half space R”i <0 so we can proceed by 
composition as before. ■ 

Proposition 10.5 If ip a = (x 1 , ..., x n ) is an oriented chart, on a neighborhood 
of p on the boundary of an oriented manifold with boundary then the vectors 
g§ 2 i •••, gfs: form a positive basis for T p dM with respect, to the induced orien- 
tation on dM. More generally, if fi is outward pointing and fi, fi, ■■■, fn is 
positive on M at. p, then fi, ■■■, f n will be positive for dM at. p. 


10.9 Integration of Differential Forms. 

Let M be a smooth n-manifold possibly with boundary dM and assume that M 
is oriented and that dM has the induced orientation. From our discussion on 
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orientation of manifolds with boundary and by general principles it should be 
clear that we may assume that all the charts in our orienting atlas have range in 
the left half space R" 1<0 . If OM = 0 then the ranges will just be in the interior 

Every fc-form a ^ on an open subset U of R"i< 0 is of the form a ^ = 
a(.)da 1 A • • • A du k for some smooth function a(.) G C°°(U). If a(.) has compact 
support supp(a) then we say that a ^ has compact support supp(cd fc )) := 
supp(a). In general, we have the following 


Definition 10.12 A the support of a differential form a G Cl(M) is the closure 
of the set {p G M : a(p) 0} and is denoted by supp(a). The set of all k-forms 
cd fc ) which have compact support contained in U C M is denoted by fl k (U). 


Let us return to the case of a form a ^ on an open subset U of R fc . If a <k> 
has compact support in U we may define the integral j fJ a by 


f a ^ = f afujdu 1 A • • • A du k 

Ju Ju 

:= f a(u) \du l ■ ■ ■ du k \ 

Ju 


where this latter integral is the Lebesgue integral of a(u). We have written 
| du 1 ■ ■ ■ du k | instead of du 1 ■ ■ ■ du k to emphasize that the order of the du l does 
not matter as it does for du 1 A • • • A du k . 

Now consider an oriented n— dimensional manifold M and let a G klf f . If 
a has compact support inside U a for some chart ij) a ,Ua compatible with the 
orientation then if- 1 : if a (U a ) — » U a and (if- 1 )* a has compact support in 
if a (U a )c R”i< 0 - We define 


lil>cAu a 


(C 1 )* 


The standard change of variables formula show that this definition is indepen- 
dent of the oriented chart chosen. Now if a G Cl n (M) does not have support 
contained in some chart domain then we choose a locally finite cover of M by 
oriented charts ifi,Ui and a smooth partition of unity pi , U{ , supp(pi) C U t . 
Then we define 


a := 


E 




wr 1 )* (/»<«) 


Proposition 10.6 The above definition is independent of the choice of the 
charts ifi, Ui and smooth partition of unity pi,Ui. 
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Proof. Let (pi, Vi, and be another such choice. Then we have 


/«:=£/ (v-rYM 

= E/ (4>i 1 )*(pi'52pj a ) 

i UiiUi) v 

ee / (' t i’i i )*(pipj a ) 


-EE 


4>j(Uir\Uj 


{<t>j ^(piPjO!) 


= E / (07 1 )*(? j a) 


10.10 Stokes’ Theorem 

Let us start with a couple special cases . 

Case 10.1 (1) Let u)j = fdu 1 A • • • A dul A • • • A du n be a smooth n — 1 form 
with compact support contained in the interior o/ R™ i <0 where the hat symbol 
over the du ? means this j-th factor is omitted. All n — 1 forms on R”i< 0 are 
sums of forms of this type. Then we have 

diyfdu 1 A • • • A dvf A • • • A dit") 

1 <0 

( df A du 1 A • • • A dui A • • • A dw 11 ) 

1 <0 

^-rdu k A du 1 A • • • A dui A • • • A dw n ) 

<9u fc ’ 

1 <0 fc 

(— l)- 7-1 ^Cdu 1 A • • • A du n = [ (-1 y- 1 ^-du 1 ■ ■ ■ du n 


by the fundamental theorem of calculus and the fact that f has compact 
support.. 



Case 10.2 (2) Let u>j = /dii 1 A • • • A duf A • • • A du n be a smooth n — 1 /orm 
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with compact support, meeting <9R"i <0 = 


u 1 — 0 


= ox: 


pn— 1 


then 


dcjj = 


d(fdu 1 A • • • A dui A • • • A du n ) 


uko 


u 1 <0 


= 0 if j ^ 1 and if j = 1 we have / dwi = 

n!<0 

=Lj- iri (L^ dul ) du * A ''' Adu " 

= f f(0,u 2 ,...,u n )du 2 ---du n 

J R ”- 1 

= f /(O, w 2 , u n )du 2 A • • • A du n = f lj i 
IVow since clearly f dRn Wj = 0 if j ^ 1 or i/wj /ias support, that, doesn’t, meet. 

u 1 <0 

9M"i <0 we see i/iai in any case L„ du>j = f aRn u)j. Now as we said all 

~ u 1 <0 u 1 <0 

n — 1 forms on R”i< 0 are sums of forms of this type and so summing such we 
have for any smooth n— 1 form on R™i <0 . 




w. 


Now we define integration on a manifold (possibly with boundary). Let 
AIm = (f> a ,Ua)aGA be an oriented atlas for a smooth orientable n-manifold 
M consisting of nice charts so either ip a : U a = R” or if a : U a = R"i< 0 - 
Now let {p a } be a smooth partition of unity subordinate to {U a }. Notice that 
{p a \u n9M } is a partition of unity for the cover {U a C I d M} of OM . Then for 
to £ f we have that 


>M 


div= y2 d(p a u>) = V' / d(p a Lu) 

Ju a a a Ju a 

= E f C rf (p« w ) = [ d (CPaW) 

a J MU a ) 

= J2[ [ (TpaPctU) 

„ Jdib„(U n ) 


= £ 


ac/ a 


= / w 

JdM 


so we have proved 


Theorem 10.4 (Stokes’ Theorem) Let. M be an oriented manifold with bound- 
ary ( possibly empty) and give dM the induced orientation. Then for any 
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co £ fl n 1 (M) we have 



10.11 Vector Bundle Valued Forms. 

It will occur in several contexts to have on hand the notion of a differential form 
with values in a vector bundle. 

Definition 10.13 Let £ = (F — > M) be a smooth vector bundle. A dif- 
ferential p—form with values in f (or values in E) is a smooth section of the 
bundle E ® /\ P T*AI. These are denoted by Ll p (f) (or informally O p (E) if the 
context is sufficient to avoid confusion with T(M,A P E)). 

In order to get a grip on the meaning of the bundle let use exhibit transition 
functions. We know that for a vector bundle knowing the transition functions is 
tantamount to knowing how local expressions with respect to a frame transform 
as we change frame(?did I explain this?). A frame for E ® A P T*M is given by 
combining a local frame for E with a local frame for A P TM. Of course we must 
choose an common refinement of the VB-charts to do this but this is obviously 
no problem. Let (ei,...,efc) frame defined on U which we may as well take to 
also be a chart domain for the manifold M. Then any local section of Ll p (f) 
defined on U has the form 


a = ® dx 1 


for some smooth functions al = a ? , defined in U. Then for a new local set 

/ *1 . . . Ip 

up with frames (/i, ■■■, fk) and dy 1 = dy 11 A • • • A dy lp (■ i\ < ... < i p ) then 

o = ^2 ® d y T 


we get the transformation law 





dx'd 

dy 7 


and where C\ is defined by f s Cf = ej. 


Exercise 10.7 Derive the above transformation law. 


Solution 10.1 V a : N.j ® dx 1 = V a : (f s C( ® ^ ydy 1 etc. 

A more elegant way of describing the transition functions is just to recall 
that anytime w have two vector bundles over the same base space and respective 
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typical fibers V and W then the respective transition functions g a p and h a p ( 
on a common cover) combine to give g a p ® h a p where for a given x £ U a p 

g a p(ft <S> h a p(x) : V ® W — > V ® W 
g a p{x) ® h a p(x).(\/, w) = g a p(x)\i ® h a p(x) w. 

At any rate, these transformation laws fade into the background since if all out 
expressions are manifestly invariant (or invariantly defined in the first place) 
then we don’t have to bring them up. A more important thing to do is to get 
used to calculating. 

If the vector bundle is actually an algebra bundle then (naming the bundle 
A — » M now for “algebra”) we may turn A® AT* M := )Cp=o ® A P T*M into 
an algebra bundle by defining 

(fi ® g 1 ) A (i> 2 ® g 2 ) := v\v 2 ® g 1 A g 2 

and then extending linearly: 

(a l jVi ® g j ) A (6fi>fc ® g l ) := ViVj ® g J A g l 

From this the sections Q(M,A) = T(M,A ® A T*M) become an algebra over 
the ring of smooth functions. For us the most important example is where 
A = End(E). Locally, say on U, sections <ji and a -2 of fl(M, End(.E)) take the 
form <ti = Ai <S> a 1 and (72=5^® /3 l where A,; and Bi are maps U — > End(E). 
Thus for each x £ U, the A, and B, evaluate to give A i (x),B i (x) £ End(Fi a ;). 
The multiplication is then 

(At ® a’) A ( Bj ® /3 j ) = A,B :i ® of A ft 
where the AiBj : U — >■ End(Fi) are local sections given by composition: 

AiBj : x i— > Ai(x) o Bj(x). 

Exercise 10.8 Show that Q(M, End(E)) acts on 0(M,E) making fl(M,E ) a 
bundle of modules over the bundle of algebras Fl(M, End(E)). 

If this seems all to abstract to the newcomer perhaps it would help to think 
of things this way: We have a cover of a manifold M by open sets {U a } which 
simultaneously trivialize both E and TM. Then these give also trivializations 
on these open sets of the bundles Hom(£, E) and ATM. Associated with each 
is a frame field for E M say (ei, e^) which allows us to associate with each 
section a £ f l p (M, E) a A:— tuple of p— forms au = (a^) for each U. Similarly, a 
section A £ f l q (M, End(E)) is equivalent to assigning to each open set U £ {U a } 
a matrix of q— forms A jj. The algebra structure on Cl(M, End(-E)) is then just 
matrix multiplication were the entries are multiplies using the wedge product 
Ajj A Bjj where 

(Au A Bu)) = A\ A B k 
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The module structure is given locally by air i— > Ajj A ajj- Where did the bundle 
go? The global topology is now encoded in the transformation laws which tell 
us what the same section look like when we change to a new from on an overlap 
U a fl Up! In this sense the bundle is a combinatorial recipe for pasting together 
local objects. 



Chapter 11 

Distributions and 
Frobenius’ Theorem 

11.1 Definitions 

In this section we take M to be a C°° manifold modelled on a Banach space 
M. Roughly speaking, smooth distribution is an assignment Aof a subspace 
A p C T p M to each p £ M such that for each p £ M there is a family of 
smooth vector fields X \ , .... X \ defined on some neighborhood U p of p and such 
that A x = span{Xi(x), ...,Xk(x)} for each x € U p . We call the distribution 
regular iff we can always choose the vector fields to be linearly independent 
on each tangent space T X M for x £ U p and each U p . It follows that in this 
case k is locally constant. For a regular distribution k is called the rank of 
the distribution. A rank k regular distribution is the same think as a rank k 
subbundle of the tangent bundle. We can also consider regular distributions of 
infinite rank by simply defining such to be a subbundle of the tangent bundle. 

Definition 11.1 A (smooth) regular distribution on a manifold M is a smooth 
vector subbundle of the tangent bundle TM . 

11.2 Integrability of Regular Distributions 

By definition a regular distribution A is just another name for a subbundle 
A C TM of the tangent bundle and we write A p C T p M for the fiber of the 
subbundle at p. So what we have is a smooth assignment of a subspace A p 
at every point. The subbundle definition guarantees that the spaces A p all 
have the same dimension (if finite) in each connected component of M . This 
dimension is called the rank of the distribution. There is a more general notion 
of distribution which we call a singular distribution which is defined in the 
same way except for the requirement of constancy of dimension. We shall study 
singular distributions later. 
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Definition 11.2 Let X locally defined vector field. We say that X lies in the 
distribution A if X(p) £ A p for each p in the domain of X. In this case, we 
write X £ A (a slight abuse of notation). 

Note that in the case of a regular distribution we can say that for X to lie 
in the distribution A means that X takes values in the subbundle A C TM. 

Definition 11.3 We say that a locally defined differential j-form to vanishes 
on A if for every choice of vector fields X\,...,Xj defined on the domain of lv 
that lie in A the function oj(X i, .. ,,Xj ) is identically zero. 

For a regular distribution A consider the following two conditions. 

Frol For every pair of locally defined vector fields X and Y with common 
domain that lie in the distribution A the bracket [X, Y] also lies in the 
distribution. 

Fro2 For each locally defined smooth 1-form to that vanishes on A the 2-form 
cko also vanishes on A. 

Lemma 11.1 Conditions (1) and (2) above are equivalent. 

Proof. The proof that these two conditions are equivalent follows easily 
from the formula 


du>(X, Y) = X{co(Y)) - Yw(X) - u([X, A]). 

Suppose that (1) holds. If to vanishes on A and X,Y lie in A then the above 
formula becomes 

div(X,Y) = -u([X,Y}) 

which shows that du vanishes on A since [X, Y] € A by condition (1). Con- 
versely, suppose that (2) holds and that X,Y £ A. Then du(X, Y) = — oj([X, Y]) 
again and a local argument using the Hahn-Banach theorem shows that [ X , Y] = 
0 . ■ 

Definition 11.4 If either of the two equivalent conditions introduced above 
holds for a distribution A then we say that A is involutive. 

Exercise 11.1 Suppose that X is a family of locally defined vector fields of M 
such that for each p £ M and each local section X of the subbundle A defined 
in a neighborhood of p, there is a finite set of local fields {X;} C X such that 
X = Y) d l Xi on some possible smaller neighborhood of p. Show that if X is 
closed under bracketing then A is involutive. 

There is a very natural way for distributions to arise. For instance, con- 
sider the punctured 3-space M = M 3 — {0}. The level sets of the function 
e : (x,y,x) i— > x 2 + y 2 + x 2 are spheres whose union is all of R 3 — {0}. Now 
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define a distribution by the rule that A p is the tangent space at p to the sphere 
containing p. Dually, we can define this distribution to be the given by the rule 

A p = {u £ T p M : de{v) = 0}. 

The main point is that each p contains a submanifold S such that A x = T X S 
for all x £ S D U for some sufficiently small open set U C M. On the other 
hand, not all distributions arise in this way. 

Definition 11.5 A distribution A on M is called integrable at p £ M there 
is a submanifold S p containing p such that A x = T X S P for all x £ S. (Warning: 
S p is locally closed but not necessarily a closed subset and may only be defined 
very near p.) We call such submanifold a local integral submanifold of A. 

Definition 11.6 A regular distribution A on M is called (completely) inte- 
grable if for every p £ M there is a (local) integral submanifold of A containing 

P- 


If one considers a distribution on a finite dimensional manifold there is a 
nice picture of the structure of an integrable distribution. Our analysis will 
eventually allow us to see that a regular distribution A of rank k on an n- 
manifold M is (completely) integrable if and only if there is a cover of M by 
charts ipa, U a such that if ip a — (j/ 1 , — , y n ) then for each p £ U a the submanifold 
S atP defined by S a , p := {x £ U a : y l (x ) = y l (p) for k + 1 < i < n} has 

A x = T x S a , p for all x £ S p . 

Some authors use this as the definition of integrable distribution but this def- 
inition would be inconvenient to generalize to the infinite dimensional case. A 
main goal of this section is to prove the theorem of Frobenius which says that 
a regular distribution is integrable if and only if it is involutive. 


11.3 The local version Frobenius’ theorem 

Here we study regular distributions; also known as tangent subbundles. The 
presentation draws heavily on that given in [LI]. Since in the regular case a 
distribution is a subbundle of the tangent bundle it will be useful to consider 
such subbundle a little more carefully. Recall that if E — > M is a subbundle of 
TM then E C TM and there is an atlas of adapted VB-charts for TM\ that 
is, charts <j> : -»(7xM=f/xExF where E x F is a fixed splitting 

of M. Thus M is modelled on the split space E x F = M. Now for all local 
questions we may assume that in fact the tangent bundle is a trivial bundle of 
the form (t/i x U 2 ) x (E x F) where U\ x U 2 C E x F. It is easy to see that our 
subbundle must now consist of a choice of subspace Ei(a;, y) of (E x F) for every 
(x,y) £ U\ x U 2 . In fact, the inverse of our trivialization gives a map 

: (Ui x U 2 ) x(ExF)^ {U\ x U 2 ) x (E x F) 
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such that the image under cf 1 of {(x, y)} x E x {0} is exactly {(a;, y)}x Ei(x, y). 
The map fr 1 must have the form 

<t>((x,y),v,w) = ((a:,?/), f [x , y )(v,w), g {x . y) (v,w)) 

for where f( x ,y) : E x F — > E and gr x , y ) '■ E x F — > E are linear maps depending 
smoothly on (. x,y ). Furthermore, for all (x,y) the map f(x, y ) takes E x {0} 
isomorphically onto {(x,y)} x Ei (x,y). Now the composition 

k : (Ui x U 2 ) x E c — > {U\ x U 2 ) x (E x {0}) (U\ x U 2 ) x (E x F) 

maps {(x,y)} x E isomorphically onto {(x, y)} x Ei(x, y) and must have form 
k(x, y , v) = (x, y , A(x, y) ■ v, £(x, y) ■ v) 

for some smooth maps ( x , y) 1 — > A(x, y) £ L( E, E) and (x, y) 1 — > £(x, y) £ L( E, F). 
By a suitable “rotation” of the space E x F for each (x, y) we may assume that 
\x,y) = idE • Now for hxecl v £ E the map X v : ( x,y ) 1 — > {x,y,v,£i x ^ y \v) is 
(a local representation of) a vector field with values in the subbundle E. The 
principal part is X v (x,y) = ( 1 >,£( x , y ) • v). 

Now £(x,y) £ L( E, F) and so D£(x,y) £ L( E x F, L(E, F)). In general for a 
smooth family of linear maps A u and a smooth map v : (x, y) 1 — > v(x, y) we have 

D( A u ■ v)(w) = DA u (w) ■ v + A u ■ ( Dv)(w ) 

and so in the case at hand 

D(£(x, y) ■ v)(w 1 ,w 2 ) 

= ( D£(x , y)(w 1 ,w 2 )) ■ v + £(x, y) ■ (Dv)(w 1 ,w 2 ). 

For any two choices of smooth maps V\ and v 2 as above we have 


[X^i (DX V2 ^( x ) y)X Vl (x, y) (-OX„ 1 ) (x, 2 /) 

= {{Dv 2 )(vi,£( Xt y)Vi) - (Dv i){v 2 ,£^ Xt y)V 2 ),De(x,y)(v 1 ,£^ Xt y)Vi) ■ v 2 
+ £{x,y) ■ {Dv 2 ){vi,£( Xt y)Vi) - D£(x,y)(v 2 ,£ {XtV )V 2 ) ■ Vi 
~ £{X,V) ■ (Dwi)(U2,^( x , y )U 2 )) 

= (£,D£(x, 2 /)(ni,^( Xi y)Ui) • v 2 - D£(x,y)(v 2 ,£( x>v )V 2 ) ■ v\ + £{x,y) ■ £). 

where £ = {Dv 2 )(vi, £( x , y )V 1 ) - (Dvi)(v 2> £( XtV )V 2 ). Thus [X Vl , X„ 2 ] is in the 
subbundle iff 


D£(x,y)(vi,£( XtV )Vi) ■ v 2 - D£(x,y){v 2 ,£^ XtV )V 2 ) ■ v\. 
We thus arrive at the following characterization of involutivity: 


Lemma 11.2 Let A be a subbundle ofTM. For every p £ M there is a tangent 
bundle chart containing T p M of the form described above so that any vector field 
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vector field taking values in the subbundle is represented as a map X t , : U\ x U 2 — »■ 
E x F of the form (x,y) 1 — > (v(x,y),£r Xty -\v(x,y)). Then A is involutive (near p) 
iff for any two smooth maps v\ : U\ x U 2 — > E and v 2 : U\ x U 2 — > E we have 

D£(x, y)(vi,£( XtV )Vi) ■ v 2 - D£(x,y)(v 2 ,£( XtV )V 2 ) ■ vi. 

Theorem 11.1 A regular distribution A on M is integrable if and only if it is 
involutive. 

Proof. First suppose that A is integrable. Let X and Y be local vector fields 
that lie in A. Pick a point x in the common domain of X and Y. Our choice 
of x being arbitrary we just need to show that [X, Y](a;) G A. Let S C M be 
a local integral submanifold of A containing the point x. The restrictions X\ s 
and yjcy are related to X and Y by an inclusion map and so by the result on 
related vector fields we have that [X| s , Yj g ] = [X, Y]| s on some neighborhood 
of x. Since S' is a manifold and [X| s , Yj g ] a local vector field on S we see 
that [X, T]| s (x) = [X, Y](x) is tangent to S and so [A,Y]( x) € A. Suppose 
now that A is involutive. Since this is a local question we may assume that our 
tangent bundle is a trivial bundle {U\ x U 2 ) x (E x F) and by our previous lemma 
we know that for any two smooth maps V\ : U\ x U 2 — > E and v 2 : U\ x U 2 —■ > E 
we have 

D£(x,y)(vi,£ ( XtV )Vi) ■ v 2 - D£(x,y)(v 2 ,£( XtV) v 2 ) ■ v\. 

Claim 11.1 For any (xo,yo) G U\ x U 2 there exists possibly smaller open 
product U[ x U 2 C U\ x U 2 containing (xo,yo) and a unique smooth map 
a : U[ x U' 2 — > U 2 such that a(xo, y) = y for all y &U' 2 and 

D 1 a(x, y) = £(x,a(x,y)) 

for all (x,y) G U[ x U^. 

Before we prove this claim we show how the result follows from it. For any 
y G U' 2 we have the partial map a y (x ) := a(x,y) and equation ?? above reads 
Da y (x,y) = £(x,a y (x)). Now if we define the map (j> : U[ x U 2 — > U\ x U 2 
by <f(x,y) := (x,a y (x)) then using this last version of equation ?? and the 
condition a(xo ,y) = y from the claim we see that 

D 2 a(x 0 , j/o ) = Da(x 0 , .)(?/o) 

= D idrr' = id . 

U 2 

Thus the Jacobian of <j> at (xo,yo) has the block form 

id 0 
* id 

By the inverse function theorem <f> is a local diffeomorphism in a neighborhood 
of (xo,yo)- We also have that 

(D 1 f>)(x, y) • (v, w) = (v, Da y (x) ■ w ) 

= (v,£(x,a y (x)) ■ v). 
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Which is the form of elements of the subbundle but is also the form of tangents 
to the submanifolds which are the images of U[ x {y} under the diffeomorphism 
<j> for various choices of y £ U 2 . This clearly saying that the subbundle is 
integrable. 

Proof of the claim: By translation we may assume that (xo,yo) = (0,0). 
We use theorem 26.18 from appendix B. With the notation of that theorem we 
let f(t, x, y) := £(tz, y ) • 2 where y € U 2 and 2 is an element of some ball B{ 0, e) 
in E. Thus the theorem provides us with a smooth map (3 : Jo x B(0, e) x U 2 
satisfying /3(0 ,z,y) = y and 

t|/3(C 2 , y ) = £(tz, /3(t, 2 , y )) • 2 . 

We will assume that 1 £ J since we can always arrange for this by making a 
change of variables of the type t = as, z = x/a for a sufficiently small positive 
number a (we may do this at the expense of having to choose a smaller e for 
the ball f?(0, e). We claim that if we define 

a(x,y) := /3(1 ,x,y) 

then for sufficiently small \x\ we have the required result. In fact we shall show 
that 

D 2 (3(t, z, y) = t£(tz, p{t, z, y)) 
from which it follows that 

Dxa(x,y) = D 2 (3(l,x,y) = £(x,a(x,y)) 

with the correct initial conditions (recall that we translated to ( Xo, yo ))• Thus 
it remains to show that equation ?? holds. From (3) of theorem 26.18 we know 
that D 2 /3(t,z,y) satisfies the following equation for any veE: 

d d 

— D 2 f3(t, 2 , y) = t-^_£{tz, (3{t, z, y)) - v-z 

+ D 2 £(tz, p{t, z, y)) ■ D 2 (3(t, z,y) -v z 
+ £{tz,f3(t,z,y)) ■ v. 

Now we fix everything but t and define a function of one variable: 

<F(t) := D 2 p(t, z, y) ■ v — t£{tz, (3{t, 2 , y). 

Clearly, <I>(0) = 0. Now we use two fixed vectors v,z and construct the fields 
X v ix,y) = (v,£r Xt y'i ■ v) and X z (x,y ) = {z,£( XtV ) ■ 2 ). In this special case, the 
equation of lemma 11.2 becomes 


D£{x,y){v,£( Xt y)v) • 2 - D£{x,y){z,£( x , y )z) • v. 
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Now with this in mind we compute -4<f>(f) : 

^$ 0 ) = ^.(D 20 (t,z,y) ■ v - U{tz,(3(t,z,y )) 

= ^ D 2 /3(t , z,y) ■ v — t^£(tz, Pit, z , y)) - £(tz, f3(t, 2 , y)) 

= ^ D 2 /3(t , z,y) ■ v — t{Di£(tz, (3{t, z , y)) • 2 

+ Z? 2 ^(te, P(t, z, y)) • y) - f(t 2 , /?(*, 2 , y)) 

= D 2 £(tz, (3(t, 2 , y)) • {£> 2 /3(f , z,y)-v - t£(tz, f3(t, 2 , y)} • 2 (use 11.3) 

= D 2 l{tz , /3(t, 2 , y)) • <f>(f) • 2 . 

So we arrive at j|<I>(t) = D 2 £(tz, (3(t, 2 , y))- < t ) (t )-2 with initial condition <t>(0) = 0 
which implies that <f>(t) = 0. This latter identity is none other than D 2 /3(t, z, y) ■ 
v = M(tz,f3(t,z,y). m 

It will be useful to introduce the notion of a co-distribution and then explore 
the dual relationship existing between distributions and co-distributions. 

Definition 11.7 A (regular) co- distribution Q on a manifold M is a subbun- 
dle of the cotangent bundle. Thus a smooth assignment of a subspace f l x C T*M 
for every x € M. If dim Q x = l < 00 we call this a rank l co-distribution. 

Using the definition of vector bundle chart adapted to a subbundle it is not 
hard to show , as indicated in the first paragraph of this section, that a (smooth) 
distribution of rank k < 00 can be described in the following way: 

Claim 11.2 For a smooth distribution A of rank on M we have that for every 
p € M there exists a family of smooth vector fields Xi,...,Xk defined near p 
such that A x = span-fX^a;), ...,Xk(x)} for all x near p. 

Similarly, we have 

Claim 11.3 For a smooth co- distribution Ll of rank k on M we have that for 
every p £ M there exists a family of smooth 1-forms fields u>\,...,LOk defined 
near p such that f l x = span{cai(a;), ..., cufc(a:)} for all x near p. 

On the other hand we can use a co-distribution to define a distribution and 
visa-versa. For example, for a regular co-distribution fi on M we can define a 
distribution A -151 by 

A^ n := {u € T X M : u> x {v) = 0 for all u/ x € fi x }. 

Similarly, if A is a regular distribution on M then we can define a co-distribution 
by 

fl x A := {uj x € TfM : cv x (v) = 0 for all v € A x }. 

Notice that if Ai c A 2 then A^- n c A 5 1 - 0 and (AifiA 2)' Lf2 = A^ + A^- n etc. 
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11.4 Foliations 

Definition 11.8 Let M be a smooth manifold modelled on M and assume that 
M = E x F. A foliation Tm of M (or on M ) is a partition of M into a family 
of disjoint subsets connected {£ Q } aGj 4 such that for every p £ M, there is a 
chart centered at p of the form (p : U ^ V x W cExF with the property that 
for each C a the connected components (U D £ a )p ofUn C a are given by 

lf((U n C a )p) = V X {c ai/3 } 

where c a ^ £ W C F are constants. These charts are called distinguished charts 
for the foliation or foliation charts. The connected sets C a are called the 
leaves of the foliation while for a given chart as above the connected components 
(U CC a )p are called plaques. 

Recall that the connected components (U D C a )p of U H C a are of the form 
C X (U FI C a ) for some x £ C a . An important point is that a fixed leaf C a may 
intersect a given chart domain U in many, even an infinite number of disjoint 
connected pieces no matter how small U is taken to be. In fact, it may be 
that C X (U fl C a ) is dense in U. On the other hand, each C a is connected by 
definition. The usual first example of this behavior is given by the irrationally 
foliated torus. Here we take M = T 2 := S l x S 1 and let the leaves be given as 
the image of the immersions i a '■ t t— >■ (e lat ,e‘*) where a is a real numbers. If a 
is irrational then the image t a (R) is a (connected) dense subset of S 1 x S 1 . On 
the other hand, even in this case there are an infinite number of distinct leaves. 

It may seem that a foliated manifold is just a special manifold but from one 
point of view a foliation is a generalization of a manifold. For instance, we can 
think of a manifold M as foliation where the points are the leaves. This is called 
the discrete foliation on M. At the other extreme a manifold may be thought 
of as a foliation with a single leaf C = M (the trivial foliation). We also have 
handy many examples of 1-dimensional foliations since given any global flow the 
orbits (maximal integral curves) are the leaves of a foliation. We also have the 
following special cases: 

Example 11.1 On a product manifold say M x N we have two complementary 
foliations: 

{{p} X iVjpgM 

and 

{M x {g}} gG Ar 

Example 11.2 Given any submersion f : M — > N the level sets {f~ 1 (q)} q eN 
form the leaves of a foliation. The reader will soon realize that any foliation is 
given locally by submersions. The global picture for a general foliation can be 
very different from what can occur with a single submersion. 


Example 11.3 The fibers of any vector bundle foliate the total space. 
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Example 11.4 (Reeb foliation) Consider the strip in the plane given by {(x, y) : 
M < 1}. For aeRU {±oo} we form leaves C a as follows: 

C a ■■= {(a;, a + f(x)) : \x\ < 1} for a G R 

C ± oo ■= {(± 1 , 2 /) : \v\ < 1 } 

where /( x) := exp — 1. By rotating this symmetric foliation about the y 

axis we obtain a foliation of the solid cylinder. This foliation is such that trans- 
lation of the solid cylinder C in the y direction maps leaves diffeomorphically 
onto leaves and so we may let Z act on C by (x, y, z) i— > (x, y + n, z) and then 
CfL is a solid torus with a foliation called the Reeb foliation. 

Example 11.5 The one point compactification o/R 3 is homeomorphic to S 3 C 
R 4 . Thus S 3 — {p} = R 3 and so there is a copy of the Reeb foliated solid torus 
inside S 3 . The complement of a solid torus in S 3 is another solid torus. R is 
possible to put another Reeb foliation on this complement and thus foliate all of 
S 3 . The only compact leaf is the torus which is the common boundary of the 
two complementary solid tori. 

Exercise 11.2 Show that the set of all v € TM such that v = T<p _1 (v, 0) for 
some v € E and some foliated chart tp is a (smooth) subbundle ofTM which is 
also equal to {v &TM : v is tangent to a leaf}. 

Definition 11.9 The tangent bundle of a foliation T m with structure pseu- 
dogroup Tm,f is the subbundle TTm ofTM defined by 

TTm {« G TM : v is tangent to a leaf} 

= {»£ TM : v = T;p^ 1 (v, 0) for some v £ E and some foliated chart p} 


11.5 The Global Frobenius Theorem 

The first step is to show that the (local) integral submanifolds of an integrable 
regular distribution can be glued together to form maximal integral submani- 
folds. These will form the leaves of a distribution. 

Exercise 11.3 If A is an integrable regular distribution ofTM, then for any 
two local integral submanifolds S± and S 2 of A that both contain a point Xq, 
there is an open neighborhood U of Xq such that 

s x n u = s 2 n u 

Theorem 11.2 If A is a subbundle ofTM (i.e. a regular distribution) then 
the following are equivalent: 

1) A is involutive. 

2) A is integrable. 

3) There is a foliation Tm on M such that TTm = A. 
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Proof. The equivalence of (1) and (2) is the local Frobenius theorem already 
proven. Also, the fact that (3) implies (2) is follows from 11.2. Finally, assume 
that (2) holds so that A is integrable. Recall that each (local) integral submani- 
fold is an immersed submanifold which carries a submanifold topology generated 
by the connected components of the intersections of the integral submanifolds 
with chart domains. Any integral submanifold S has a smooth structure given 
by restricting charts U, ip on M to connected components of S ft U (not on all 
of 5 fl 17!). Recall that a local integral submanifold is a regular submanifold 
(we are not talking about maximal immersed integral submanifolds!). Thus we 
may take U small enough that S fl U is connected. Now if we take two (local) 
integral submanifolds Si and S 2 of A and any point Xg £ Si fl S 2 (assuming 
this is nonempty) then a small enough chart U, ip with xg G U induces a chart 
U D Si, fpluns-L on '-’1 an< ^ a c h ar t C Xo {U n S 2 ), V’lcy (uns 2 ) on ^2- But as we 
know Si fl U = S2 fl U and the overlap is smooth. Thus the union Si U S 2 
is a smooth manifold with charts given by U fl (Si U S 2 ), 4’\un(S 1 uS 2 ) anc ^ the 
overlap maps are U fl (Si fl S 2 ), ' i /’l[/n(SinS 2 )' We ma y ex t en( l to a maximal 
connected integral submanifold using Zorn’s lemma be we can see the existence 
more directly. Let C a {x 0 ) be the set of all points that can be connected to Xg 
by a smooth path c : [0, 1] — > M with the property that for any tg G [0, 1], the 
image c(<) is contained inside a (local) integral submanifold for all t sufficiently 
near tg. Using what we have seen about gluing intersecting integral submani- 
fold together and the local uniqueness of such integral submanifolds we see that 
C a (x 0 ) is a smooth connected immersed integral submanifold that must be the 
maximal connected integral submanifold containing xg . Now since xg was arbi- 
trary there is a maximal connected integral submanifold containing any point 
of M. By construction we have that the foliation C given by the union of all 
these leaves satisfies (3). ■ 

There is an analogy between the notion of a foliation on a manifold and a 
differentiable structure on a manifold. In order to see this more clearly it will 
be useful to introduce the notion of a pseudogroup. The example to keep in 
mind while reading the following definition is the family of all locally defined 
diffeomorphisms d :: M — > M. 

With a little thought, it should be fairly clear that a foliation is the same 
thing as a manifold with a Fm,f structure and the atlas described in the definition 
just given is a foliated atlas. 

From this point of view we think of a foliation as being given by a maximal 
foliation atlas which is defined to be a cover of M by foliated charts. The 
compatibility condition on such charts is that when the domains of two foliation 
charts, say ipi : U\ — > V\ x W\ and <^2 : U 2 — ► U 2 x W 2 , then the overlap map 
has the form 


V2 0 < 4 >\ 1 ( x ) y) = (/(x,y),ff(y)). 

A plaque in a chart tp\ : U\ — > V\ x W\ is a connected component of a set of the 
form <pj" 1 {(x, y) : y =constant}. 
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11.6 Singular Distributions 

Lemma 11.3 Let Xi , ..., X n be vector fields defined in a neighborhood of x £ M 
such that Xi(x), ...,X n (x) are a basis for T X M and such that [Xi ,Xj] = 0 in a 
neighborhood of x. Then there is an open chart U, ip = (y 1 , containing x 

such that Xi\jj = ffA . 

Proof. For a sufficiently small ball 5(0, e) C K™ and t = (ti, ..., t n ) £ 5(0, e) 
we define 

f(ti,...,t n ) — FI* 1 o ■ ■ ■ o Fl? n n (x). 

By theorem 7.10 the order that we compose the flows does not change the value 
of f(tu^.,t n ). Thus 

d ,, . 

— /(fi,...,t„) 
oti 

= W i Fl^ 0 ---°Flt n ( x ) 

= —Flff o FI* 1 o • • • o Flf"(x) (put the i-th flow first) 

Oti 

X i (Fl£o...oFl? 7 f(x)). 

Evaluating at t = 0 shows that T 0 f is nonsingular and so (ti, ..., t n ) i— > f(t\, ..., t n ) 
is a diffeomorphism on some small open set containing 0. The inverse of this 
map is the coordinate chart we are looking for (check this!). ■ 

Definition 11.10 Let Xi oc (M) denote the set of all sections of the presheaf 
Xm- That, is 

Xi oc (M) := |J X M {U). 

open UCM 

Also, for a distribution A let Xa(M) denote the subset of Xi oc (M) consisting 
of local fields X with the property that X(x) £ A x for every x in the domain of 
X. 

Definition 11.11 We say that a subset of local vector fields X C X&(M) spans 
a distribution A if for each x £ M the subspace A x is spanned by {X(x) : X £ 
X}. 

If A is a smooth distribution (and this is all we shall consider) then £a (M) 
spans A. On the other hand as long as we make the convention that the empty 
set spans the set {0} for what every vector space we are considering then any 
X C £a (M) spans some smooth distribution which we denote by A(X). 

Definition 11.12 An immersed integral submanifold of a distribution A is an 
injective immersion l : S — > M such that T s l(T s S) = A t ( s ) for all s £ S. 
An immersed integral submanifold is called maximal its image is not properly 
contained in the image of any other immersed integral submanifold. 
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Since an immersed integral submanifold is an injective map we can think 
of S' as a subset of M. In fact, it will also turn out that an immersed integral 
submanifold is automatically smoothly universal so that the image t(Sj is an 
initial submanifold. Thus in the end, we may as well assume that S C M 
and that t : S — ■> M is the inclusion map. Let us now specialize to the finite 
dimensional case. Note however that we do not assume that the rank of the 
distribution is constant. 

Now we proceed with our analysis. If i : S — > M is an immersed integral 
submanifold and of a distribution A then if X £ £a (M) we can make sense of 
l*X as a local vector field on S. To see this let U be the domain of X and take 
s £ S with l(s) £ U. Now X(i(s)) £ T s l(T s S) we can define 

t*X(s) := (T st )~ 1 X( i (s)). 

i*X(s) is defined on some open set in S and is easily seen to be smooth by 
considering the local properties of immersions. Also, by construction i* X is l 
related to X. 

Next we consider what happens if we have two immersed integral submani- 
folds Li : Si — > M and t 2 : S 2 — ► M such that ti(Si)ni. 2 (jS 2 ) 7^ 0- By proposition 
7.1 we have 

tj o Flj ! X = Fp o a for i = 1,2. 

Now if xq £ (■i(S'i)nt 2 (S , 2 ) then we choose Si and s 2 such that ti(si) = i 2 (s 2 ) = 
Xo and pick local vector fields X \, ..., Xk such that (Xi(xo), •••, Xk(x 0)) is a basis 
for A Xo . For i = 1 and 2 we define 

:= (F^^o-.-oFl^) 

and since | 0 f t = /,* X 3 for i = 1,2 and j = 1, ..., k we conclude that /j, * = 1,2 
are diffeomorphisms when suitable restricted to a neighborhood of 0 £ M fc . Now 
we compute: 

(r" 1 o tl o /i)^ 1 , ... ,t k ) = (A 1 0 A 0 Fi;r Yl o • • • o Fi;P fe )0n) 

= (* 2 - 1 F1^ 1 o...oF1^o* 1 )(xi) 

= (Fi;p i o...oFi;p fe o*- i o* 1 )( a:i ) 

-Ad/ 1 I k ). 

Now we can see that (p is a diffeomorphism. This allows us to glue together 
the all the integral manifolds which pass through a fixed x in M to obtain a 
unique maximal integral submanifold through x. We have prove the following 
result: 

Proposition 11.1 For a smooth distribution A on M and any x £ M there is 
a unique maximal integral manifold L x containing x called the leaf through x. 
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Definition 11.13 Let X C X; oc (M) . We call X a stable family of local vector 
fields if for any X, Y G X we have 

(Flf )*Y G A’ 

whenever (F1*)*T is defined. Given an arbitrary subset of local fields X C 
Xi oc (M) let S(X) denote the set of all local fields of the form 

(FI* 1 o Fl f I 2 o • • • o Fl* fc )*F 

where Xi , Y G X and where t = (t 1 , ..., t k ) varies over all k-tuples such that the 
above expression is defined. 

Exercise 11.4 Show that 5(A”) is the smallest stable family of local vector fields 
containing X . 

Definition 11.14 If a diffeomorphism 4> of a manifold M with a distribution A 
is such that T x (f>{ A x ) C ^m x ) f or a U x G M then we call <j> an automorphism 
of A. If <f> : U — > <p(U) is such that T x <j>( A x ) C A^( x ) for all x G U we call a 

local automorphism of A . 

Definition 11.15 If X £ X; oc (M) is such that T X F1*(A X ) C A F1 X( X ) we call 
X a (local) infinitesimal automorphism of A. The set of all such is denoted 
auti oc (A). 

Example 11.6 Convince yourself that aut/ oc (A) is stable. 

For the next theorem recall the definition of %&■ 

Theorem 11.3 Let A be a smooth singular distribution on M. Then the fol- 
lowing are equivalent: 

1) A is integrable. 

2) £a is stable. 

3) autzoc(A) ft spans A. 

4) There exists a family X C X; oc (M) such that S(X) spans A. 

Proof. Assume (1) and let X G Xa- If C x is the leaf through x G M then 
by proposition 7.1 

FI* o t = l o Fl'l** 
where t : C x M is inclusion. Thus 

T X (F1* )(A X ) = T(F1* ) • T x l ■ ( T X C X ) 

= T(t o Fh**) • (T X C X ) 

= TlT x { FF**) • (T X C X ) 

= TlT FV *X( x )C x = A pv *X( x y 

Now if Y is in Xa then at an arbitrary x we have Y (a;) G A x and so the above 
shows that ((Fl*)*F)(a;) G A so (F1*)*F) is in Xa • We conclude that Xa is 
stable and have shown that (1) => (2). 
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Next, if (2) hold then Xa C aut; oc (A) and so we have (3). 

If (3) holds then we let X := aut; oc (A) 01 a- Then for Y. Y £ X we have 
(Fi- Y )*y g anc i so X C S(X) C Xa- from this we see that since X and Xa 
both span A so does S(A’). 

Finally, we show that (4) implies (1). Let x £ M. Since S(X) spans the 
distribution and is also stable by construction we have 

F(Fl t )A X = A F1 x( x ) 

for all fields X from S(X). Let the dimension A x be k and choose fields 
X\ ,...,Xk £ S(X) such that Xi(x), ...,Xk(x) is a basis for A x . Define a map 
/ :: R k — > M by 

f(t\ ..., t n ) := (FI* 1 FI* 2 o • • • o Fl**)(z) 

which is defined (and smooth) near 0 £ R k . As in lemma 11.3 we know that 
the rank of / at 0 is k and the image of a small enough open neighborhood of 0 
is a submanifold. In fact, this image, say S = f(U ) is an integral submanifold 
of A through x. To see this just notice that the T X S is spanned by ^j(O) for 
j = 1, 2, k and 

^ (0) = ^| o (F1 ? F1 ?°-"° F1 ? )(a:) 

= ^(Fl^Fl* 2 o • • • o Flf/L-^XjiiFlf/Flf/^ o • • • o Fl^XaO) 

= ((Fi^nn^r ° ° (Fi^-_ i 1 )*^)(/(t 1 . -,*"))• 

But S(X) is stable so each J£(0) lies in A f( t y From the construction of / and 
remembering ?? we see that span{^(0)} = Tf^S = Ajp) and we are done. 



Chapter 12 

Connections on Vector 
Bundles 

12.1 Definitions 


A connection can either be defined as a map V : X(M) x T(M, E) — > T(M, E ) 
from which one gets a well defined map V : TM x F ( M, E) —> T(M,E) or 
the other way around. The connection should also be natural with respect to 
restrictions to open sets and so a sheaf theoretic definition could be given. 

Definition 12.1 A connection on a C°°— vector bundle E — > M is a map V : 
X(M) x r(M, E) — > r(M, E) (where V(X, s) is written as V xs) satisfying the 
following four properties for all f G C°° , X,Xi,X 2 £ 3£(M), s,si,s 2 G T(M,E) 
and 

1. Vfx(s) = fV x s for all f G C°°, X G 3t(M) and s G T(M,E) 

2. V a -i+a- 2 s = VajS + Va' 2 s for all X\ , X 2 G X(M) and s G T(M,E) 

3- Va'(si + s 2 ) = VxSi + Va ^2 for all X G X(M) and si, S 2 € r (M, E) 

4. V A -(/s) = (Xf)s + /Va(s) for all / G C°° , X G X(M) and s G T(M, E) 

As we will see below, for finite dimensional E and M this definition is enough 
to imply that V induces maps X u : X(U) x T(U,E) — > T(U,E) which are 
naturally related in the sense we make precise below and furthermore the value 
(V xs)(p) depends only on the value X p and on the values of s along any smooth 
curve c representing X p . The proof of these facts depends on the existence of 
smooth bump functions and so forth. We have already developed the tools 
to obtain the proof easily in sections 2.8 and 9.4 and so we leave the trivial 
verification of this to the reader. 
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In the infinite dimensional case we are not guaranteed such thing and so we 
may as well include the extra properties into the definition: 

Definition 12.2 ((better)) A natural covariant derivative (or connec- 
tion 1 ) V on a smooth vector bundle E — » M is an assignment to each open set 
U C M of a map X u : X(U) x T(U,E) -► X{U) written X u : (V, s) -► S7 u x s 
such that the following hold: 

1. VyS is C°° (U)-linear in X, 

2. V y s is R-linear in Y, 

3. V y(/s) = /V yS + (Xf)s for all X,Y G X(U), s G T(U,E) and all 
f€C°°(U). 

4- If V C U then ry(Vys) = ^^u x r v s (naturality with respect to restric- 
tions). 

5. (Vys)(p) only depends of the value of X at p (infinitesimal locality). 

X x s is called the covariant derivative of s with respect to X. We will 
denote all of the maps X u by the single symbol V when there is no chance of 
confusion. 

In the same way that extends a derivation to a tensor derivation one may 
show that a covariant derivative on a vector bundle induces naturally related 
connections on all the multilinear bundles. In particular, 7r* : E* — > M denotes 
the dual bundle to E —> M we may define connections on 7r* : E* — > M and 
on 7r <8 7r* : E 8 E* — > M. We do this in such a way that for e G r(M, E) and 
e* G F(M,E*) we have 

Vf 0ir (e®e*) = V x e«e*+e0 Vf* e* 

and 


(Vf'e*)(e) = X(e*(e))-e*(V x e). 

Of course this last formula follows from our insistence that covariant differenti- 
ation commutes with contraction: 

X(e*(e)) = 

(V x C(e ® e*)) = C'tVf 05 * (e <g> e*)) 

= C (v x e ®e* + e® Vf ' e*) 

= e*(V x e) + (vfe*)(e) 

1 lt would be better if we could avoid the term connection at this point and use covariant 
derivative instead since later we will encounter another definition of a connection which refers 
to a certain “horizontal” subbundle of the tangent bundle of the total space of a vector 
bundle (or also of a principal bundle) . Connections in this sense determine a natural covariant 
derivative. Conversely, a natural covariant derivative determines a horizontal subbundle, i.e. 
a connection in this second (more proper?) sense. Thus a natural covariant derivative and a 
connection are mutually determining. 
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where C denotes the contraction / ® f* i— > /*(/). All this works like the tensor 
derivation extension procedure which we have already done. 

Now the bundle E®E* — > M is naturally isomorphic to End (A) and by this 
isomorphism we get a connection on End(E). 

(V x A)(e) = Vx(A(e)) - A(V x e) 

Exercise 12.1 Prove this last formula. 

Solution: Since c : e (g) A i— > A(e) is a contraction we must have 

Vx(A(e)) = c (V x e <8> A + e ® VaA) 

= A(y x e) + (S7 x A)(e) 

12.2 Local Frame Fields and Connection Forms 

Let 7r : E — > M be a rank k vector bundle with a connection V. Take M 
top be of finite dimension n. Recall that a choice of a local frame field over 
an open set U C M is equivalent to a trivialization of the restriction Ejj. We 
now examine expression for the connection from the view point of such a local 
frame field e = (ei,...,efc). Recall that we have a vector bundle chart (a local 
trivialization) on an open set U exactly when there exists a frame field. It is 
not hard to see that there must be a matrix of 1-forms A = (A„) i< 0l b<fc such 
that for X £ T(U) we may write 

Vxe. = A b a (X)e b . 

Here and in what follows we use the Einstein summation convention. Also 
the dependence on the point of evaluation is suppressed since something like 
p i— > A b a | (X p )e b (p) is rather awkward looking. The matrix of 1-forms A may 
be thought of as a matrix valued 1-form A so that for a fixed vector field defined 
on U we have that p i— > A p (X p ) is a matrix valued function on U. Now let us 
assume that U is simultaneously the domain of a chart (x 1 , ... ,x n ) on M. Then 
we may write X = X l di and then 

V 9i e a = A h a {di)e b = A b a e b 


V A -s = Va (s a e a ) 

= VA(s°e a ) 

= (Xs a )e a + s a X7 x e a 
= (Xs a )e a + s a A b a (X)e b 
= (Xs a )e a + s r A a r {X)e a 
= (Xs a + A a r {X)s r )e a 


and so 
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So the a— component of V.y s is (Vxs) a = Va -_s a := Xs a + A^(X)s r . Of 
course the frame are defined only locally say on some open set U . The restriction 
Ejj is trivial. Let us examine the forms A b a on this open set. The change of 
frame 

rb b 

f = 9 a e b 

which in matrix notation is 

/ = eg. 

Differentiating both sides 

f = eg 
V/ = V {eg) 
fA' = (Ve)g + edg 
fA! = egg~ 1 Ag + egg~ 1 dg 
fA' = fg~ 1 Ag + fg~ 1 dg 
A 1 = g~ 1 Ag + g~ 1 dg 

Conversely, we have the following theorem: 

Theorem 12.1 Let it : E — > M be a smooth F —vector bundle of rank k. 
Suppose we are given a cover {U a } of the base space M by the domains of 
frame fields e a = (e“,...,e£) and an association of a matrix valued l— form 

a. 

to A : U a — ► gl(k, F) (g) T*U a to each. Then there is a unique connection on 
7r : E — > M which is given in each U a by 

Xxs = {Xs a + A a r {X)s r )e a 

for s = X> a e a . 

Sometimes one hears that A is locally an element of Hom(£, E ) but the 
transformation law just discovered says otherwise. The meaning of the state- 
ment can only be the following: If E were trivial then we could choose a distin- 
guished global frame held e = (ei, ..., e n ) and define a connection by the simple 
rule V°y(sV 0 ) = ( Xs a )e a ■ Now an the above calculation of the transformation 
law shows that the difference of two connections on a vector bundle is in fact an 
element of Hom^,!?). Equivalently, if A A = A— A is the difference between 
the connection forms for two different connections then under a change of frame 
we have 

(AA)' = A' -A! 

= g X Ag + g _1 dg - (g^ x Ag + g~ l dg) 

= g X Ag - g~ l Ag 

= 5 _1 (AA) 5 

so that A A defines a section of the bundle End(A). 
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Exercise 12.2 Show that the set of all connections on E is naturally an affine 
space C(E) whose vector space of differences is End(E). For any fixed con- 
nection V° we have an affine isomorphism End(E) — > C(E) given by A A i— > 
V° + AA. 

Now in the special case mentioned above for a trivial bundle the connection 
form in the defining frame is zero and so in that case AA = A. So in this 
case A determines a section of Hom(E, E). Now any bundle is locally trivial 
so in this sense A is locally in End(E). But this is just confusing and in fact 
cheating since we have changed (by force so to speak) the transformation law 
for A among frames defined on the same open set to that of A^4 rather than A. 
The point is that even though A A and A are equal in the distinguished frame 
they are not the same after transformation to a new frame. It seems to the 
author best to treat A for what it is; a matrix valued 1— form which depends 
on the frame chosen. 


12.3 Parallel Transport 

Suppose we are given a curve c : I — > M together with an E— valued section 
along c; that is a map a : I E such that the following diagram commutes: 

E 

a 

/ I 

I A M 

We wish to define X t cr = Xp t a. Lets get some motivation. If c is an integral 
curve of a field X then we have 

(' V x s)(c(t )) = (X(c(t)).s a (c(t)) + (A?\ c(t) X c(t) )s r (c(t)))e a (c(t)) 

= (c(t).s a (t))e a (t) + (A“ | c(t) c(t))s r (t)e a (t ) 

where we have abbreviated s a (t ) := s a (c(t )) and e a (t) := e a (c(t)). This shows 
that the value of Vxs at c(t) depends only on c(t) and (soc)(t). This observation 
motivates the following definition: 

Definition 12.3 Let c : I — > M be a smooth curve and a an E values section 
along c. We define another section along c denoted X g t a by the requirement 
that with respect to any frame field (e a ) we have 

V 3t a := + (^lc(t) c(t))a r (t)e a (t) 

Since c might not be even be an immersion the definition only makes sense 
because of the fact that it is independent of the frame. To do the calculation 
which shows this frame independence it will pay to make the following abbrevi- 
ations 
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Figure 12.1: Parallel Transport. 


1. a' = (a[, ..., a' k ) (the components of a with respect to a new basis / = eg) 

2. &o*(t)=do' 

3. A=(A? | c(t) c(t)) 

Then using matrix notion we have 

fda ’ + fA'a' 

= egd(g~ 1 c r) + eg{g~ 1 Ag + g~ 1 dg)g~ 1 a 
= eg{g~ 1 dcr - g~ 1 dgg~ 1 a) + eAa + egdgg _1 cr 
= eda + eAg. 

Exercise 12.3 Flesh out this calculation without the abbreviation. 

Now on to the parallel transport. 

Definition 12.4 Let c : [a, 6] — > M be a smooth curve. A section a along c is 
said to be parallel along c if 

(V dt cr)(t) = 0 for all t € [a, b]. 

Similarly, a section a £ T(M,E) is said to be parallel if V xo = 0 for all 

x e X(M). 

Exercise 12.4 Show that a £ T(M, E) is a parallel section iff X o c is parallel 
along c for every curve c : I — > M 



12.3. PARALLEL TRANSPORT 


195 


Exercise 12.5 Show that for f : I — > K and a : I — > M is a section of E along 
c then Va t (/<7 ) = (§ct + /V^er. 

Exercise 12.6 Continuing the last exercise show that if a : I — > U C M where 
U is the domain of a local frame field {ei, .. . , e* } then aft) = y? =1 <7®(i)ei(c(i)). 

Theorem 12.2 Given a smooth curve c : [a, 6] — » M and numbers to £ [a, 5] 
with c(t o) = p and vector v £ E p there is a unique parallel section a c along c 
such that a c (to) = v. 

Proof. In local coordinates this reduces to a first order initial value problem 
which may be shown to have a unique smooth solution. Thus if the image of 
the curve lies completely inside a coordinate chart then we have the result. The 
general result follows from patching these together. This is exactly what we do 
below when we generalize to piecewise smooth curves so we will leave this last 
part of the proof to the skeptical reader. ■ 


Under the conditions of this last theorem the value a c [t) is vector in the fiber 
E c ( t ) and is called the parallel transport of v along c from c(<o) to c(t). Let us 
denote this by P(c)\ Q v. Next we suppose that c : [a, b] —> M is a (continuous) 
piecewise smooth curve. Thus we may find a monotonic sequence to, fi, ...tj = t 
such that Ci := ch t ._ i t , (or ch t . ti _A) is smooth. 2 In this case we define 

P{c)\ 0 :=P(c) t tj _ 1 ---oP(c) t t l 

Now given v £ E c (t 0 ) as before, the obvious sequence of initial value problems 
gives a unique piecewise smooth section a c along c such that a c (to) = v and the 
solution must clearly be P(c)\ Q v (why?). 

Exercise 12.7 P(c)* 0 : -E c p 0 ) — > E c (t) a linear isomorphism for all t with 
inverse P{c)\° and the map t i— > a c (t) = P(c)\ o v is a section along c which is 
smooth wherever c is smooth. 


We may approach the covariant derivative from the direction of parallel 
transport. Indeed some authors given an axiomatic definition of parallel trans- 
port, prove its existence and then use it to define covariant derivative. For us 
it will suffice to have the following theorem: 


Theorem 12.3 For any smooth section a of E defined along a smooth curve 
c : I — > M . Then we have 


(Va t cr)(f) = lim 
£ — ^0 


P(c)\ + ,a(t 


e) 


a{t) 


2 It may be that t < to . 


e 
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Proof. Let e\, e k be a basis of E c ( t(j) for some fixed to € I. Let e.j (t) := 
P{c)\. Then Vg t ej(<) = 0 and a(t) = Yh c*(t)ej(f). Then 


lim P(c)t+e<r(t + e) - o-(^) 

e *o e 

= Hm + e)P{c)\ +e ej{t + e) - a(t) 

e — *0 e 


= lim 

e — *0 


cr*(t + e)ei(t) - a l (t)ei(t) 




On the other hand 

(Va t cr)W = Va t (cr J (t)ei(<)) 

fjfji 

E cfcr* 

dA' )e ‘ (,) ' 


1— form 9 = Y e j@ l which takes any vector to itself: 

6 ( Np) = ^Z e j{p) d \ v p) 

= ^2 vte j(p) = v p 

Let us write d v 9 = \Y e k® T^9 l A = \ Y e k 0 T k . If V is the Levi Civita 
connection on M then consider the projection P A : E <g>TM ®T*M given by 
P A T(£,v) = T(£,v) — T(v,£). We have 

_ u 

vej = cjj e k = ecu 

V8 j : -J k e k 


V{(ej ® 6> J ) 

P A (X/c9J){v) = -ui{0e k {v)+uV k {v)8 k { 0 = -J k A 8 k 
Let T(£,u) = V € (ei®^')(v) 

= (V 5 ej) 0 9 J (v) + e* ® (V^)( W ) = uj k {£)e k 0 6> J (i>) + e» 0 (-u 3 k (£,)9 k (v)) 
= uf(£)e k 0 (^O) + e* 0 (-aj k ({)9 j (v)) = e k 0 (wf (0 - w$(£))9 j (v) 

Then 
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(P A T)(£,v)=T(t;,v)-T(v,Z) 

= (Vej) A + ej 0 dQi 
= d v (ej 0 


d v 6> = d v Y^ e o 0J 

= E(Ve ,) ( >:«W' ( 12 . 1 ) 

= E(E e fe 0 a;*) A 6h + E e& <8> dO k 

k 

= E e fc®(E^ A ^+ d0fe ) 

& j 

So that JN A Qi + dO k = \ r k . Now let cr = ^ P e j be a vector field 

d v d v a = d v (d V E e t^ J ) = rfV (E( Ve ^)^ + E e i 0 d - f J ) 

(E(ve,)^ + E' : ' /r v<,)/ ; + E V( .'//■' + E ej 0 ddfi ^ 

E/'(^v<,) >> 

So we seem to have a map / J ej i— > Q k fie k . 

e r f2j = d v Vej = d v {e k u; k ) 

= Vefe A Wj + e k du> k 
= e r u r k A Wj + e k dui k 
= e r u^ A Wj ' + e r du>'j 
= e r (dujj + iv k A Wj ) 


d v Ve = d v (eu>) = Ve A w + eciw 


From this we get 0 = d(A 1 A)A 1 = (cL4 J )^L4 1 + A 1 c?Agl 1 
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dA~ l = A~ 1 dAA ~ 1 

Q r j = du] + u r k A w* 

SI = dio + lo A u> 

O — duj cu A u) 

Q' = d (A-'uA + A~ l dA) + (A-'wA + A~ 1 dA) A (A~ 1 toA + A~ 1 dA) 

— d ( A ^ioAdj -\~ d (A i dAdj T A A coA T A A ciA 
+ A~ 1 dAA~ 1 Awi+ A~ x dA A A~ x dA 
= d (A 4- dA ' A dA T A A u^A -I- A A dA 

+ A~ 1 dAA~ i ujA + A~ l dA A A _1 dA 

= dA~ l u>A + A _1 dwA — A~ 1 udA + dA -1 A dA + A _1 w A w4 + A _1 w A dA 
+ A~ 1 dAA ~ 1 Awi+ A~ l dA A A _1 dA 
= A ^dc^A -|- A ^cu A ujA 
ST = A~ l OA 

oj' = A~ 1 ojA + A~ 1 dA 

These are interesting equations let us approach things from a more familiar 
setting so as to interpret what we have. 


12.4 Curvature 

An important fact about covariant derivatives is that they don’t need to com- 
mute. If a : M — > E is a section and X £ X(M) then Vx<r is a section 
also and so we may take it’s covariant derivative VyVjd with respect to some 
Y £ In general, VyVxcr ^ VxVya and this fact has an underly- 

ing geometric interpretation which we will explore later. A measure of this 
lack of commutativity is the curvature operator which is defined for a pair 
X,Y £ £(M) to be the map F(X, Y) : T(E) — > T(E) defined by 

F(X, 4 )(T := VxVyC — VyVjfCr — V [X,Y] &■ 
or 

[Vx,Vy](T - V [X,Y\® 

Theorem 12.4 For fixed a the map (X, Y) i— > F(X,Y)a is C°°(M) bilinear 
and antisymmetric. 

F(X,Y) : T(E) — > T(E) is a C°°(M) modide homomorphism; that is it is linear 
over the smooth functions: 


F(X,Y)(fa) = fF(X,Y)(a) 
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Proof. We leave the proof of the first part as an exercise. For the second 
part we just calculate: 

F(X,Y)(fa) = VxVy/a - VyVx/a - V [x , Y] f<J 

= Vx(/Vya + (Yf)a) - Vy(/V x a + (Xf)a) 

-fV [x>Y] a-({X,Y}f)o 

= /VxVya + (Xf)Vya + ( Yf)Vx<J + X(Y f) 

- fVyVxa - (Yf)Xxa - (Xf)Xya - Y(Xf) 

- fV[x,y]<r ~ {[X,Y]f)v 

= /[V A-,Vy] - fV [x , Y] a = fF{X,Y)a 


Exercise 12.8 Prove the first part of theorem 12. f. Now recall that 

End c .oo(r(£)) 

£* r(M, End(E)) 

= r(M, E (g) E*) 

Thus we also have f as a map E : X(M) x X(M) — > r(M, End(E)). But 
then since E is tensorial in the first two slot and antisymmetric we also may 
think in the following terms 

E € T(M, Horn (E, E) <g) A 2 M) 
or 

E G T(M,E®E* (g) A 2 M). 

In the current circumstance it is harmless to identify E ® E* ® A 2 Af with 
A 2 Af (g) E (g) E* the second one seems natural too although when translating 
into matrix notation the first is more consistent. In any case we have a natural 
structure of an algebra on each fiber given by 

{A <g) a) A (B <g) (3) := (A o B) (g) a A (3 

and this gives a C°°(Af)— algebra structure on T(Af, Hom(E, E) (g) A 2 Af). 

We will describe the relationship between the curvature E and parallel trans- 
port but first lets see another approach to curvature. For a vector bundle 
E — > M we may construct the 

Let e = (ei, ..., e^) be a frame defined on an open set U and for the restriction 
of a section to U we write 

k 

cr = J2 a u e i 

i = 1 
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for smooth functions : U — + F (which is R or C). Then locally, 

k 

F(X,Y)a = F(X,Y)-J2<ei 

i= 1 
k 

= ^4F(X,y)e, 

i= 1 
k 

J2<V(X,Y) 

i = 1 



Chapter 13 

Riemannian and 
semi-Riemannian Manifolds 


The most beautiful thing we can experience is the mysterious. It is the source of all 

true art and science. 

-Albert Einstein 


13.1 The Linear Theory 

13.1.1 Scalar Products 

Definition 13.1 A scalar product on a (real) finite dimensional vector space V 
is a nondegenerate symmetric bilinear form g : V x V — > ffi. . The scalar product 
is called 

1. positive (resp. negative) definite if g(v,v) > 0 (resp. g(v,v) < 0) for all 
v £ V and g(v, v) = 0 =>• v = 0. 

2. positive (resp. negative) semidefinite if g(v,v) > 0 (resp. g(v,v) < 0) for 
all v £ V. 

Nondegenerate means that the map g : V — > V* given by tm g(v , .) is a 
linear isomorphism or equivalently, if g(v, w) = 0 for all w £ V implies that 
v = 0. 

Definition 13.2 A scalar product space is a pair V,g where V is a vector 
space and g is a scalar product. 

Remark 13.1 We shall reserve the terms inner product and inner product 
space to the case where g is positive definite. 

Definition 13.3 The index of a symmetric bilinear g form on V is the largest 
subspace W C V such that the restriction g\ w is negative definite. 
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Given a basis B = {v\ , v n ) for V we may form the matrix [g] which has 
as ij- th entry g (i>i,Vj). This is the matrix that represents g with respect to the 
basis B. So if v = B [v] B , w = B [w] e then 

g('iVic) = [vf[g] BB M e . 

It is easy to see that the index ind(g) is zero iff g positive semidefinite. It is a 
standard fact from linear algebra that if g is a scalar product then there exists 
a basis e±, e n for V such that the matrix representative of g with respect to 
this basis is a diagonal matrix diag(— 1, 1) with ones or minus ones along the 
diagonal and we may arrange for the minus ones come first. Such a basis is called 
an orthonormal basis for V, g. The number of minus ones appearing is the index 
ind(g) and so is independent of the orthonormal basis chosen. Thus if ei, ..., e n is 
an orthonormal basis for V, g then g (e*, e^) = Cid t j where e, = g(e*, ef) = ±lare 
the entries of the diagonal matrix the first ind(g) of which are equal to —1 and 
the remaining are equal to 1. Let us refer to the list of ±1 given by (ei, ...., e n ) 
as the signature. 

Remark 13.2 The convention of putting the minus signs first is not universal 
and in fact we reserve the right to change the convention to a positive first 
convention but ample warning will be given. The negative signs first convention 
is popular in relativity theory but the reverse is usual in quantum field theory. 
It makes no physical difference in the final analysis as long as one is consistent 
but it can be confusing when comparing references from the literature. 

Another difference between the theory of positive definite scalar products 
and indefinite scalar products is the appearance of the e* from the signature in 
formulas which would be familiar in positive definite case. For example we have 
the following: 

Proposition 13.1 Let ei, ...,e n be an orthonormal basis for V,g. For any v £ 
V,g we have a unique expansion given by v = JT effv, ef)ei. 

Proof. The usual proof works. One just has to notice the appearance of 
the e.j. ■ 


Definition 13.4 Let V,g be a scalar product space. We say that v and w are 
mutually orthogonal iff g(v,w) = 0. Furthermore, given two subspaces Wi and 
W 2 of V we say that Wi is orthogonal to W 2 and write Wi J_ W 2 iff every 
element of Wi is orthogonal to every element 0 /W 2 . 

Since in general g is not necessarily positive definite or negative finite it may 
be that there are elements that are orthogonal to themselves. 

Definition 13.5 Given a subspace W of a scaler product space V we may con- 
sider the orthogonal subspace W 1- = {u £ V : g(v,w) = 0 for all w £ W}. 
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We always have dim(W) + dim(W J -) = dim(V) but unless g is definite we 
may have W fl W 1 - 7^ 0. Of course by nondegeneracy we will always have 
U 1 - = 0. 

Definition 13.6 A subspace W of a scaler product space V, g is called nonde- 
generate if g| w is nondegenerate. 

Lemma 13.1 A subspace W C V, g is nondegenerate iff~V = W © (inner 
direct sum). 

Proof. Easy exercise in linear algebra. ■ 

Just as for inner product spaces we define a linear isomorphism R : Vi, g! — * 
V2, g2 from one scalar product space to another to be an isometry if gi(i>, w) = 
g2 (Rv,Rw). It is not hard to show that if such an isometry exists then gi and 
g2 have the same index and signature. 

13.1.2 Natural Extensions and the Star Operator 

If we have a scalar product g on a finite dimensional vector space V then there 
is a natural way to induce a scalar product on the various tensor spaces TJ(V) 
and on the Grassmann algebra. The best way to explain is by way of some 
examples. 

First consider V*. Since g is nondegenerate there is a linear isomorphism 
map gt : V — > V* defined by 


gb(*0M = &(v,w). 

Denote the inverse by g® : V* — > V. We force this to be an isometry by defining 
the scalar product on V* to be 

g*(a,/3) = g(g # (a), g a (/?))• 

Under this prescription, the dual basis e 1 , ..., e" to an orthonormal basis e ±, ..., e n 
for V will be orthonormal. The signature (and hence the index) of g* and g are 
the same. 

Next consider T) (V) = V © V*. We define the scalar product of two simple 
tensors v\ © ay, V2 © 02 G V © V* by 


gi(t>i ® <*1,^2 © 012) = g(v 1 ,v 2 )B*(a 1 ,a 2 ). 


One can then see that for orthonormal dual bases e 1 , ..., e n and e \, ..., e n we have 
that 

{o © <i,j<n 

is an orthonormal basis for T) (V), g| . In general one defines g( so that the 
natural basis for TJ(V) formed from orthonormal e 1 , ..., e" and e \, ..., e n will be 
orthonormal. 
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Notation 13.1 In order to reduce notational clutter let us agree to denote all 
the scalar products coming from g simply by (., .). 

Exercise 13.1 Show that under the natural identification o/V®V* with L(V, V) 
the scalar product of a linear transformation A with it self is the trace of A. 

Next we see how to extend the maps gt and g ^ to maps on tensors. We 
give two ways of defining the extensions. In either case, what we want to define 
is maps (g b )j : T r fl (V) T r ~ 1 s+1 (V) and (g#)j : T^(V) -> T r+1 ^(V) 
where 0 < i < r and 0 < j < s. Our definitions will be given on simple tensors 
by 


(gb)l('U'l <8 • • • 8 w r ® w 1 ® • • • ® u> s ) 

= w\ ® • • • 8 wl ® • • • w r 8 gt, (wf) 8 w 1 ® ■ ■ ■ <8 u s 


and 


(g 11 )] (toi ® • • • 8 w r 8 co 1 8 • • • 8 w s ) 

= wi ® • • • ® w r ® g^w 7 ’) ® u 1 ® • • • ® ob ® • • • ® u s . 

This definition is extended to all of TJ (V) by linearity. For our second, equiva- 
lent definition let T £ T r S (V). Then 

((gb)tY)(a\...,a r_ Vi,---,^+i) 

:= T(a\ ..., a r_1 ,gb(u 1 ); v 2 , -,v s+1 ) 


and similarly 


:= Y(a\ ..., a r ,g # (a r+1 ); v 2 , u s +i) 


Lets us see what this looks like by viewing the components. Let fi,—,f n be an 
arbitrary basis of V and let f 1 , ..., f n be the dual basis for V*. Let g,j := g (/,;, ff) 
and g y = g *(/*,/ J ). The reader should check that J2kSkjg lk = Now let 
t € TJ(V) and write 


T = T* 1 ’”’* r ■ J f,. 8 • • • 8 fi r 8 f 1 8 • • • 8 f 3e - 


Define 




Then 


._ 3 ukm 


(g tt )l r = T^""’\r 3a r fix® ■■■ ® /i r 8 f ja ® f n 8 • • • 8 Z^- 1 . 


Jlv>Ja-l,JajJa + l”->Js-l 
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Thus the (g 11 )^ visually seems to raise an index out of a-th place and puts it up 
in the last place above. Similarly, the component version of lowering (gt)“ takes 


and produces 


■ ,js ~~ T iajlf-,js 


How and why would one do this so called index raising and lowering? What 
motivates the choice of the slots? In practice one applies this type changing only 
in specific well motivated situations and the choice of slot placement is at least 
partially conventional. We will comment further when we actually apply these 
operators. The notation is suggestive and the g^ and g\, and their extensions 
are referred to as musical isomorphisms . One thing that is useful to know 
is that if we raise all the lower indices and lower all the upper ones on a tensor 
then we can “completely contract” against another one of the original type with 
the result being the scalar product. For example, let r = J2njf l ® and 
X = EXijP ® P ■ Then letting the components of (g 1 *) J o (g**)I(x) by we 
have 

x ij = g ik g jl xu 


In general, unless otherwise indicated, we will preform repeated index raising 
by raising from the first slot (g^)J o- • -o(gB)J and similarly for repeated lowering 
(gl>)J ° • • • ° (gb)j- For example, 

Aijki i— ► A i j k i = g ta A a j k i i— > A J kl = g ta gj b A a i,ki 
Exercise 13.2 Verify the above claim directly from the definition of (x, r). 

Even though elements of L^ lt (V) = /\ k (Y*) can be thought of as tensors of 
type 0, k that just happen to be anti-symmetric, it is better in most cases to 
give a scalar product to this space in such a way that the basis 

{e 11 /\--- r\e lk } il< ... <ik = {e 1 } 

is orthonormal if e 1 : 1 ...,e n is orthonormal. Now given any /c-form u> = a^e 1 
where e 1 = e 11 A • • • A e lk with i\ < ... < i^ as explained earlier, we can also 
write lo — haie 1 and then as a tensor 


W = ki aie 


~ k\ ail "' ike 
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Thus as a covariant tensor we have 

(oJ,Lo) — J^yj2 a ii---ik a 
= did 1 

and as a k-form we want the scalar product to give 

(to, w) = a fa 1 

1 i 

= M aia 

so the two definitions are different by a factor of hi. The definition for forms 
can be written succinctly as 

(a 1 A a 2 A • • • A a k , (3 1 A /3 2 A • • • A /3 k ) 

= det((a\ (3 3 )) 

where the a 1 and are 1-forms. 

Definition 13.7 We define the scalar product on /\ k V* = L k lt fiV) by first 
using the above formula for wedge products of 1-forms and then we extending 
(bi)linearly to all of f\ k V* . We can also extend to the whole Grassmann algebra 
A V* = ® A V* by declaring forms of different degree to be orthogonal. We 
also have the obvious similar definition for /\ k V and A V. 

We would now like to exhibit the definition of the very useful star operator. 
This will be a map from /\ k V* to V* for each k, 1 < k < n where 

n = dim(M). First of all if we have an orthonormal basis e 1 , ...., e n for V* then 
e 1 A • • • A e n € /\ n N*. But /\ n V* is one dimensional and if I : V* — > V* is 
any isometry of V* then le 1 A • • • A le n = ie 1 A • • • A e n . In particular, for any 
permutation a of the letters {1, 2, ..., n} we have e 1 A • • • A e n = sgn(a)e al A • • • A 
e an . 

For a given £* = {e 1 , ...,e”} for V* (with dual basis for orthonormal basis 
£ = {ei, ..., e n }) us denote e 1 A • • • A e n by e(£ *). Then we have 

(e(£*),e(£*)) = ei e 2 ••■£„ = ±1 

and the only elements u> of /\ n V* with (oj,u) = ±1 are e(£*) and —e{£*). 
Given a fixed orthonormal basis £* = {e 1 , ..., e n }, all other orthonormal bases 
B* bases for V* fall into two classes. Namely, those for which e(B*) = e(£*) 
and those for which e(B*) = —e(£*). Each of these two top forms ±e(£*) is 
called a metric volume element for V*, g* = (, ). A choice of orthonormal basis 
determines one of these two volume elements and we call this a choice of an 
orientation for V*. On the other hand, we have seen that any nonzero top 
form u> determines an orientation. If we have an orientation given by a top 
form u> then £ = {ei,...,e n } determines the same orientation if and only if 
w(ei, ..., e n ) > 0. 
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Definition 13.8 Let an orientation be chosen on V* and let £* = {e 1 ,...,e n } 
be an oriented orthonormal frame so that vol := e(£*) is the corresponding 
volume element. Then if T = {/ i, /„} is a basis for V with dual basis T* = 
{/\->/ n } then 

vol = \J | det (gij ) | ./ 1 A • • • A /” 

where g xj = (fi,fj). 

Proof. Let e* = a*-/-? then 

e t S ij = ± 5 « : (e\e?) = (a? k f k ,aU m ) 

= aiai n (f k ,r)=aiai n g km 

so that ±1 = det(a^) 2 det(g fcm ) = (det(a^.)) 2 (det(gij)) _1 and so 

Y |det(gij)| = det(Ofc). 

On the other hand, 

vol := e{£*) = e 1 A • • • A e n 

= </ fcl A • • • A a n k J k 1 = det(4)/ 1 A • • • A /" 

and the result follows. ■ 

Fix an orientation and let £* = {e 1 ,...,e n } be an orthonormal basis in 
that orientation class. Then we have chosen one of the two volume forms, say 
vol = £■(£*). Now we define * : /\ k V* — » /\ n k V* by first giving the definition 
on basis elements and then extending by linearity. 

Definition 13.9 Let V* be oriented and let {e 1 , ...,e n } be a positively oriented 
orthonormal basis. Let a be a permutation of (1, 2, ..., n). On the basis elements 
e'A 1 ) A • • • A e'A fc ) for /\ k V* define 

*{e a(1) A • • • A e CT(fe) ) = e CTl e CT2 • • • e <T(t s 5 n( ( j)e' T(fe+1) A • • • A e a{n) . 

In other words, 

*(e Jl A • • • A e lk ) = ±(ej 1 ej 2 • • • e ik )e jl A • • • A e jn ~ k 

where we take the + sign iff e n A • • • A e lk A e J1 A • ■ • A e 0 n-k = e 1 A • • • A e". 

Remark 13.3 In case the scalar product is positive definite ei = €2 ■■■ = e n = 1 
and so the formulas are a bit let cluttered. 

We may develop a formula for the star operator in terms of an arbitrary 
basis. 

Lemma 13.2 For a, /3 £ /\ k V* we have 


(a, j3)vol = a A *(3 
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Proof. It is enough to check this on typical basis elements e n A • • • A e lk 
and e mi A • • • A e mk . We have 

(e mi A • • • A e mk ) A *(e h A • • • A e ik ) (13.1) 

= e mi A • • • A e mfe A (±e h A • • • A e 7 "-' 1 ) 

This latter expression is zero unless {mi, ...,rrik} U {ji , j n -fc} = {1,2, ...,n} 
or in other words, unless {i i, ..., **,} = {mi, ...,?7ifc}. But this is also true for 

(e mi A • • • A e mk ,e il A ■ ■ ■ A e ik ) vol . (13.2) 

On the other hand if {i\,...,ik} = {mi ,...,mfc} then both 13.1 and 13.2 give 
± vol. So the lemma is proved up to a sign. We leave it to the reader to show 
that the definitions are such that the signs match. ■ 

Proposition 13.2 The following identities hold for the star operator: 

1) *1 = vol 

2) *vol = (-l) ind te) 

3) * * a = (-l) ind (s)(-l )Hn-k) a f or a e y*. 

Proof. (1) and (2) follow directly from the definitions. For (3) we must first 
compute *(e 7 ' fc+1 A • • • A e Jn ). We must have *(e- 7fc+1 A • • • A e^") = ce J1 A • • • A 
for some constant c. On the other hand, 

ceie 2 • • • vol = (e- 7fc+1 A • • • A e- 7 '" , ce- 71 A • • • A e- 7fc ) 

= (e 7fc+1 A"-Ae J ", *(e- 7fc+1 A • • • A e- 7 ")) 

= Ofe+i ' ' ' e Jn s fl n (jfc+i ’ --InJi ' ■■jk)e 3k+1 A • • • A e 7 " A e- 71 A • • • A 
Ofe+i ' ' ' On S5«0fe+i ' ' ' in , ji • • • ifc) vol 
= ( _l)*(n-fc) e . fc+i ... c . BV0l 

so that c = e Jfc+1 • • • (— l) fc ( n_fc ). Using this we have, for any permutation 

J {jl ! 

* * (e- 71 A • • • A e 3k ) = *ej x ej 2 ■ ■ ■ ej k sgn(J)e 3k+1 A • • • A e 7 " 

= O 1 O 2 • • ' OfcOfe+i ’ ’ ' e jn s 9 n (J)e J1 A • A e 7 ‘ 

= (-i) ind te)(_i) fc O- fc ) e ii A . . . a e 7 ‘ 

which implies the result. ■ 


13.2 Surface Theory 

Let S' be a submanifold of R 3 . The inverse of a coordinate map ip : V — » U C R 2 
is a parameterization x :Z7 — > V C S of a portion U of our surface. Let (iti, W 2 ) 
the coordinates of points in V. For example, the usual parameterization of the 
sphere 

x(<p, 6) = (cos 9 sin ip, sin 9 sin ip, cos ip) . 
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A curve on a surface S may be given by first letting t i— > , u 2 (t)) be a 

smooth curve into U and then composing with x :U — > S. For concreteness let 
the domain of the curve be the interval [a, b\. By the ordinary chain rule 

X = UidiX+U 2 d 2 X 


and so the length of such a curve is 


pb pb 

L= / \x(t)\dt= / \u\dix+u 2 d 2 x\ dt 

J a J a 

r-b 

= / C gijU\U2) 1/2 dt 
J a 


where g t j = diX-djX. Let p = x(ui,u 2 ) be arbitrary in V C S. The bilinear 
form g p given on each T p S C TR 3 where p = x(iti, u 2 ) given by 

g P (v,w) = gijV l w J 

for v p = v 1 dix.+v 2 d 2 x gives a tensor g is called the first fundamental form or 
metric tensor. The classical notation is ds 2 = ^ gijdujduj which does, whatever 
it’s shortcomings, succinctly encodes the first fundamental form. For example, 
if we parameterize the sphere S 2 C R 3 using the usual spherical coordinates p, 9 
we have 

ds 2 = dp 2 + sin 2 (ip)d6 2 

from which the length of a curve c(t) = x(ip(t), 9(t)) is given by 


i(c) = £ \/(f ) 2 + “VW(f) 2 <"- 

Now it may seem that we have something valid only in a single parameter- 
ization. Indeed the formulas are given using a single chart and so for instance 
the curve should not stray from the chart domain V. On the other hand, the 
expression g p (v,w) = gijV l w J is an invariant since it is just the length of the 
vector v as it sits in R 3 . So, as the reader has no doubt anticipated, gijv'w J 
would give the same answer now matter what chart we used. By breaking up a 
curve into segments each of which lies in some chart domain we may compute 
it’s length using a sequence of integrals of the form J (g^uxii^^dt. It is a sim- 
ple consequence of the chain rule that the result is independent of parameter 
changes. We also have a well defined notion of surface area of on S. This is 
given by 

Area(S) := f dS 

Js 


and where dS is given locally by \J g{v} ,u 2 )dv}du 2 where g := det(g,y). 
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We will need to be able to produce normal fields on S. In a coordinate patch 
we may define 


N = dix(ui,U 2 ) x <9 2 x(ui, it 2 ) 


= det 


dx 1 

dx 1 

du\ 

du 3 

dx 2 

dx 2 

dul 

dul 

dx 3 

dx 3 

_ du 1 

du 2 


j 

k 


The unit normal field is then n = N/ |7V|. Of course, n is defined independent 
of coordinates up to sign because there are only two possibilities for a normal 
direction on a surface in R 3 . The reader can easily prove that if the surface is 
orientable then we may choose a global normal field. If the surface is a closed 
submanifold (no boundary) then the two choices are characterized as inward 
and outward. 

We have two vector bundles associated with S that are of immediate interest. 
The first one is just the tangent bundle of S which is in this setting embedded 
into the tangent bundle of R 3 . The other is the normal bundle NS which has as 
its fiber at p £ S the span of either normal vector ±n at p. The fiber is denoted 
N p S. Our plan now is to take the obvious connection on TR 3 , restrict it to S and 
then decompose into tangent and normal parts. Restricting to the tangent and 
normal bundles appropriately, what we end up with is three connections. The 
obvious connection on R 3 is simply V{(X^ = i F*^) '■= dY l (£)gfr which exist 
simply because we have a global distinguished coordinate frame { -£~r } . The fact 
that this standard frame is orthonormal with respect to the dot product on R 3 
is of significance here. We have both of the following: 


1. V^(X • Y) = V tl • Y + X • VcF for any vector fields X and Y on R 3 and 
any tangent vector £. 

2. Vj o V„ = V v oVt (This means the connection has not “torsion” as we 
define the term later) . 


Now the connection on the tangent bundle of the surface is defined by pro- 
jection. Let £ be tangent to the surface at p and Y a tangent vector field on the 
surface. Then by definition 


= (v ? f) t 

where (VeF) T (p) is the projection of V^F onto the tangent planes to the surface 
at p. This gives us a map V : TS x 3C(M) — > X{M) which is easily seen to be 
a connection. Now there is the left over part (V^F) 3 - but as a map (£, Y) i— > 
(V^F) 3 - this does not give a connection. On the other hand, if 77 is a normal 
field, that is, a section of the normal bundle NS we define Vfr/ := (V^??) 3 -. 
The resulting map V : TS x T(S, NS) — > T(5, NS) given by (£, 77 ) 1 — > is 

indeed a connection on the normal bundle. Here again there is a left over part 
(V{7/) t . What about these two left over pieces (V^?y) T and (V^F) 3 -? These 
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pieces measure the way the surface bends in R 3 . We define the shape operator 
at a point p £ S with respect to a unit normal direction in the following way. 
First choose the unit normal field n in the chosen direction as we did above (lets 
say “outward” for concreteness). Now define S(p ) : T p S — > T p S by 

S{p)i = V 5 n. 

To see that the result is really tangent to the sphere just notice that n • n = 1 
and so V^n 


0 = £1 = £(n • n) 
= 2V^n • n 


which means that Vcn £ T p S. Thus the fact, that n had constant length gave 
us Vtn =(V^n) T and we have made contact with one of the two extra pieces. 
For a general normal section 77 we write p = fn for some smooth function on 
the surface and then 

(V^) T = (V^/n) T 

= (d/(£)n + /V^n) T 

- fS(p)£. 

so we obtain 

Lemma 13.3 S(p)£ = / _1 (V^/n) T 


n 



The next result tell us that S(p) : T p S — > T p S is symmetric with respect to 
the first fundamental form. 
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Lemma 13.4 Let v,w £ T p S. Then we have g p (S(p)v,w) = g p (v, S(p)w). 

Proof. The way we have stated the result hide something simple. Namely, 
tangent vector to the surface are also vectors in R 3 under the usual identification 
of TR 3 with R 3 . With this in mind the result is just S(p)v ■ w = v ■ S(p)w. Now 
this is easy to prove. Note that n-w = 0 and so 0 = v(n- w) = V„n-u’ + n- V„u>. 
But the same equation holds with v and w interchanged. Subtracting the two 
expressions gives 


0 = V„n ■ w + n • \7 v w 
- (Vu,n • v + n • V w v ) 

= V„n • w — V^n • v + n • (V„u> — \7 w v) 
= V„n • w — V^n • v 


from which the result follows. ■ 

Since S ( p ) is symmetric with respect to the dot product there are eigenvalues 
ki ,«2 an d eigenvectors v Kl ,v K2 such that v Ki ■ S(p)v K/j = SijK.i. Let us calculate 
in a special coordinate system containing our point p obtained by projecting 
onto the tangent plane there. Equivalently, we rigidly move the surface until 
p is at the origin of R 3 and is tangent to the x, y plane. Then the surface is 
parameterized near p by (u 1 , u 2 ) i— > (it 1 , it 2 , /(u 1 , u 2 )) for some smooth function 
/ with Jyr(O) = = 0. At the point p which is now the origin we have 

gij{ 0) = Sij. Since S is now the graph of the function / the tangent space 
T p S is identified with the x,y plane. A normal held is given by gradT 1 = 
grad (f(x,y) - z) = ( J^-, Jjfj, , -1) and the unit normal is 


n (u 1 , w 2 ) 



( df_ df_ 

du 1 ! du 2 ’ 


- 1 ) 


Letting r(u 1 ,u 2 ) := ((^fr) 2 + (J^p-) 2 + l) 1 ^ 2 and using lemma 13.3 we have 
S(p)£ = n) T = r~ x (\7 ^N) T where N := (J^-, , — 1). Now at the 

origin r = l and so £ • S(p)£ = V<W ■ £ = = d $ 0 u i € k € from which 

we get the following: 

(-s(p), = J2 f v aJ£r<°> 

valid for these special type of coordinates and only at the central point p. Notice 
that this means that once we have the surface positioned as a graph over the 
x, y-plane and parameterized as above then 

£• S(p)v = D 2 f(^v) at 0. 

Here we must interpret £ and v on the right hand side to be (^ 1 ,^ 2 ) and (u ,v 2 ) 
where as on the left hand side £ = £ 1 ^ + £ 2 J^,u = u 1 ^+v 2 ^ r . 
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Exercise 13.3 Position S to be tangent to the x. y plane as above. Let the x,z 
plane intersect S in a curve C\ and the y, z plane intersect S in a curve c 2 . Show 
that by rotating we can make the coordinate vectors g^r, g^a be eigenvectors for 
S(p) and that the curvatures of the two curves at the origin are k± and n 2 . 

We have two important invariants at any point p. The first is the Gauss 
curvature K := det(S) = Kik 2 and the second is the mean curvature H = 
\trace(S) = I(ki + k 2 ). 

The sign of H depends on which of the two normal directions we have cho- 
sen while the sign of n does not. In fact, the Gauss curvature turns out to 
be “intrinsic” to the surface in the sense that it remains constant under any 
deformation of the surface the preserves lengths of curves. More on this below 
but first let us establish a geometric meaning for H. First of all, we may vary 
the point p and then S becomes a function of p and the same for H (and K). 

Theorem 13.1 Let St be a family of surfaces given as the image of maps ht : 
S — » R 3 and given by p i— > p+ tv where v is a section of TR 3 ^ with v(0) = 1 
and compact support. Then 

-f- area(St) = — [ (v • Hn)dS 

dt t = o Js 

More generally, the formula is true if h : (— e, e) x S' — » R 3 is a smooth map and 

v Cp) : = JsUoMt.P)- 

Exercise 13.4 Prove the above theorem by first assuming that v has support 
inside a chart domain and then use a partition of unity argument to get the 
general case. 

Surface S is called a minimal surface if H = 0 on S. It follows from 
theorem 13.1 that if St is a family of surfaces given as in the theorem that if So 
is a minimal surface then 0 is a critical point of the function aft) := area(S t ). 
Conversely, if 0 is a critical point for all such variations of S then S' is a minimal 
surface. 

Exercise 13.5 Show that Sherk's surface, which is given by e z cos (y) = cos a:, 
is a minimal surface. If you haven't seen this surface do a plot of it using Maple 
or some other graphing software. Do the same for The helicoid ytanz = x. 

Now we move on to the Gauss curvature K . Here the most important fact is 
that K may be written in terms of the first fundamental form. The significance 
of this is that if S\ and S 2 are two surfaces and if there is a map (f : Si —> S 2 
which preserves the length of curves, then n Sl and k S2 are the same in the sense 
that K Sl = K S2 o (f>. In the following theorem, LL g., /} = 5ij to first order” means 
that gij { 0) = Sij and (0) = (0) = 0. 
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Theorem 13.2 (Gauss’s Theorema Egregium) Let p £ S. There always 
exist coordinates u,v centered at p (so u(p) = 0 ,v(p) = 0) such that gij = Sij to 
first order at 0 and for which we have 


K(p) 


d 2 gi2 

dudv 


(0) 


1 d 2 g 2 


2 du 2 


-( 0 ) 


1 & 2 gil 

2 dv 2 


( 0 ). 


Proof. In the coordinates described above which give the parameterization 
(it, v) i — * (it, v, f(u , v)) where p is the origin of R 3 we have 


9u(u,v) 312(11,11) 

321 (u, v) 322 (it, It) 


1 , ( df)2 dfdf 

1 ~r ( dx > dx dy 

did! 1 , 

dx dy 1 \dy> 


from which we find after a bit of straight forward calculation 




= det S(p) = K ( p ) . 


Note that if we have any other coordinate system s,t centered at p then 
writing (u, v) = (a; 1 ,® 2 ) and (s,<) = (ir , ar) we have the transformation law 

dx k dx 1 
9ij = 9 kl d¥d& 

which means that if we know the metric components in any coordinate system 
then we can get them, and hence K{p ), at any point in any coordinate system. 
The conclusion is the that the metric determines the Gauss curvature. We say 
that K is an intrinsic invariant. 


13.3 Riemannian and semi-Riemannian Metrics 

Consider a regular submanifold M of a Euclidean space, say R”. Since we iden- 
tify T. p M as a subspace of T p R™ = R" and the notion of length of tangent 
vectors makes sense on R ra it also makes sense for vectors in T p M. In fact, if 
X p ,Y p G T p M and ci,c 2 are some curves with Ci( 0 ) = X p , 02(0) = Y p then 
Ci and c 2 are also a curves in R". Thus we have an inner product defined 
g p (X p ,Y p ) = ( X p ,Y p ). For a manifold that is not given as submanifold of some 
R n we must have an inner product assigned to each tangent space as part of an 
extra structure. The assignment of a nondegenerate symmetric bilinear form 
g p £ T p M for every p in a smooth way defines a tensor field g £ X 2 ( M j on M 
called metric tensor. 

Definition 13.10 If g £ X 3 (M) is nondegenerate, symmetric and positive def- 
inite at every tangent space we call g a Riemannian metric (tensor). If g is 
a Riemannian metric then we call the pair M, g a Riemannian manifold . 
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The Riemannian manifold as we have defined it is the notion that best 
generalizes to manifolds the metric notions from surfaces such as arc length of 
a curve, area (or volume), curvature and so on. But because of the structure of 
spacetime as expressed by general relativity we need to allow the metric to be 
indefinite. In this case, some vectors might have negate or zero length. 

Recall the index of a bilinear form is the number of negative ones appearing 
in the signature. 

Definition 13.11 If g £ 3f!j ( M ) is symmetric nondegenerate and has con- 
stant index on M then we call g a semi- Riemannian metric and M, g a semi- 
Riemannian manifold or pseudo- Riemannian manifold. The index is 
called the index of M, g and denoted ind(M). The signature is also constant 
and so the manifold has a signature also. If the index of a semi- Riemannian 
manifold (with dim(M) > 2) is (— 1,+1,+1 + 1, ...) (or according to some con- 
ventions (1,— 1,— 1 — !,■■■)) then the manifold is called a Lorentz manifold 


The simplest family of semi-Riemannian manifolds are the spaces R" which 
are the Euclidean spaces R™ endowed with the scalar products given by 

v n 

{x,y) v = ~^2x l y l + X V- 

i= 1 i=is- 1-1 


Since ordinary Euclidean geometry does not use indefinite scalar products we 
shall call the spaces R" pseudo-Euclidean spaces when the index v is not zero. 
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If we write just R" then either we are not concerned with a scalar product at 
all or the scalar product is assumed to be the usual inner product {v = 0).Thus 
a Riemannian metric is just the special case of index 0. 

Definition 13.12 Let M, g and N. li be two semi- Riemannian manifolds. A 
diffeomorphism '!> : M — > N is called an isometry z/<J>*h = g. Thus for an 
isometry $ : M — > N we have g (v,w) = h (T<3> • v,T< f> • w) for all v,w £ TM. 
If $ : M — > N is a local diffeomorphism such that <I>*h = g is called a local 
isometry. 

Example 13.1 We have seen that a regular submanifold of a Euclidean space 
R™ is a Riemannian manifold with the metric inherited from R" . In particular, 
the sphere S ” _1 C R" is a Riemannian manifold. Every isometry of .S'" -1 is the 
restriction to S' 11-1 of an isometry o/R" that fixed the origin (and consequently 
fixes S'" -1 / 

One way to get a variety of examples of semi-Riemannian manifolds is via 
a group action by isometries. Let us here consider the case of a discrete group 
that acts smoothly, properly, freely and by isometries. We have already seen 
that if we have an action p : G x M — > M satisfying the first three conditions 
then the quotient space M/G has a unique structure as a smooth manifold 
such that the projection k : M —> M /G is a covering. Now since G acts by 
isometries p*(., .) = (., .) for all g € G. The tangent map Tn : TM — » T(M/G) 
is onto and so for any v K ( p ) £ T K ^(M/G) there is a vector v p £ T p M with 
T p K.Vp = v K (p). In fact there is more than one such vector in T p M (except in 
the trivial case G = {e}) but if T p n.v p = T q n.w q then there is a g £ G such that 
p g p = q and T p pgV p = w q . Conversely, if p g p = q then T p n.{T p p g v p ) = T q n.w q . 
Now for vi,v 2 € T p M define h(v\,v 2 ) = (vi,v 2 ) where v\ and V 2 are chosen so 
that Tn.Vi = Vi- From our observations above this is well defined. Indeed, if 
T p K.Vi = T q K.Wi = Vi then there is an isometry p g with p g p = q and T p p g Vi = Wi 
and so 

(vi,v 2 ) = (T p pgVi, T p p g v 2 ) = (wi,w 2 ). 

It is easy to show that x i— > h x defined a metric on M / G with the same signature 
as that of (., .) and K*h = (., .). In fact we will use the same notation for either 
the metric on M/G or on M which is not such an act of violence since k is now 
a local isometry. The simplest example of this construction is R"/T for some 
lattice T and where we use the canonical Riemannian metric on R". In case 
the lattice is isomorphic to Z" then R n /T is called a flat torus of dimension n. 
Now each of these tori are locally isometric but may not be globally so. To be 
more precise, suppose that fi, f 2 , ...,/„ is a basis for R" which is not necessarily 
orthonormal. Let T f be the lattice consisting of integer linear combinations of 
/i, f 2 , ..., /„. The question now is what if we have two such lattices Tf and 
T j when is R”/T f isometric to R”/rj? Now it may seem that since these are 
clearly diffeomorphic and since they are locally isometric then they must be 
(globally) isometric. But this is not the case. We will be able to give a good 
reason for this shortly but for now we let the reader puzzle over this. 
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Every smooth manifold that admits partitions of unity also admits at least 
one (in fact infinitely may) Riemannian metrics. This includes all finite di- 
mensional paracompact manifolds. The reason for this is that the set of all 
Riemannian metric tensors is, in an appropriate sense, convex. To wit: 

Proposition 13.3 Every smooth manifold admits a Riemannian metric. 

Proof. As in the proof of 13.3 above we can transfer the Euclidean metric 
onto the domain U a of any given chart via the chart map if a . The trick is to 
piece these together in a smooth way. For that we take a smooth partition of 
unity U a ,p a subordinate to a cover by charts U a ,ijj a . Let g a be any metric on 
U a and define 

g (P) = '^2p a (p)9a(p)- 

The sum is finite at each p £ M since the partition of unity is locally finite and 
the functions p a g a are extended to be zero outside of the corresponding U a . 
The fact that p a > 0 and p a > 0 at p for at least one a easily gives the result 
that g positive definite is a Riemannian metric on M. m 

The length of a tangent vector X p £ T p M in a Riemannian manifold is given 
by yjg (X p , X p ) = yj (X p , X p ). In the case of an indefinite metric (v > 0) we 
will need a classification: 

Definition 13.13 A tangent vector v £ T p M to a semi- Riemannian manifold 
M is called 

1. spacelike if (u, v) > 0 

2. lightlike or null if {v, v) = 0 

3. timelike if (v, v) < 0. 

Definition 13.14 The set of all timelike vectors T p AI in is called the light 
cone at p. 

Definition 13.15 Let I C K. be some interval. A curve c : I — > M, g is called 
spacelike, lightlike, or timelike according as c(t) £ T C ^M is spacelike, lightlike, 
or timelike respectively for all t £ I. 

For Lorentz spaces, that is for semi-Riemannian manifolds with index equal 
to 1 and dimension greater than or equal to 2, we may also classify subspaces 
into three categories: 

Definition 13.16 Let M, g be a Lorentz manifold. A subspace W C T p M of 
the tangents space is called 

1. spacelike if g| w is positive definite, 

2. time like if g| w nondegenerate with index 1 , 
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3. lightlike if g| w is degenerate. 

Remark 13.4 (Notation) We will usually write (X p ,Y p ) or g(X p ,Y p ) in place 
of g(p)(X p , X p ). Also, just as for any tensor field we define the function (X,Y) 
which for a pair of vector fields is given by ( X , Y)(p) = ( X p , Y p ). 

In local coordinates (a: 1 , x n ) on U C M we have that g|[/ = X] g l3 dx l CZidx :i 
where g i? = (gfi, gfy). Thus if X = £) -X -1 gfr and Y = Y) on U then 

(X,Y) =Y, gij X iYi (13.3) 

Remark 13.5 The expression (X,Y) — X^g ijX l Y l means that for all p £ U 
we have (X(p),Y(p)) = X^S ij(p)X l (jp)Y l (p) where as we know that functions 
X 1 and Y l are given by X 1 = dx z (X) and Y l = dx l (Y). 

Recall that a continuous curve c : [a,b] —>■ M into a smooth manifold is 
called piecewise smooth if there exists a partition a = to < t\ < ■ ■ ■ < tk = b 
such that c restricted to [i», i»+i] is smooth for 0 < i < k — 1. Also, a curve 
c : [a, b] — > M is called regular if it has a nonzero tangent for all t £ [a, b\. 

Definition 13.17 Let M, g be Riemannian. If c : [a, b\ — > M is a (piecewise 
smooth) curve then the length of the curve from c(a) to c(b) is defined by 

length (c) = f (c(t), c(t)) 1 ^ 2 dt. (13.4) 

c(a)—>c(b) Ja 

Definition 13.18 Let M, g be semi- Riemannian. If c : [ a,b } — > M is a (piece- 
wise smooth) timelike or spacelike curve then 

rb 

T c (a),c(b)(c) = / \(c(t),d(t))\ 1/2 dt 

J a 

is called the length of the curve. 

In general, if we wish to have a positive real number for a length then 
in the semi-Riemannian case we need to include absolute value signs in the 
definition so the proper time is just the timelike special case of a generalized 
arc length defined for any smooth curve by / Q b \{c(t),c(t))\ 1 ^ 2 dt but unless the 
curve is either timelike or spacelike this arc length can have some properties that 
are decidedly not like our ordinary notion of length. In particular, curve may 
connect two different points and the generalized arc length might still be zero! 
It becomes clear that we are not going to be able to define a metric distance 
function as we soon will for the Riemannian case. 

Definition 13.19 A positive reparameterization of a piecewise smooth curve 
c : / — ► M is a curve defined by composition c o / -1 : J — > M where f : I — > J 
is a piecewise smooth bijection that has f > 0 on each subinterval C I 

where c is smooth. 
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Remark 13.6 (important fact) The integrals above are well defined since 
d(t) is defined except for a finite number of points in [a, b}. Also, it is important 
to notice that by standard change of variable arguments a positive reparameteri- 
zation c(u) = c(/ _1 (w)) where u = /(f) does not change the ( generalized ) length 
of the curve 

/ \(c{t),c{t))\ 1/2 dt= / \(d(u),d(u))\ 1/2 du. 

Ja Jf-Ra) 

Thus the (generalized) length of a piecewise smooth curve is a geometric property 
of the curve; i.e. a semi-Riemannian invariant. 

13.4 The Riemannian case (positive definite met- 
ric) 

Once we have a notion of the length of a curve we can then define a distance 
function (metric in the sense of “metric space”) as follow. Let p,q G M. Con- 
sider the set path(p, q) of all smooth curves which begin at p and end at q. We 
define 


dist(p, q) = inf {Z eR:I = length(c) and c G path(p, q)}. (13.5) 

or a general manifold just because dist(p, q) = r does not necessarily mean 
that there must be a curve connecting p to q having length r. To see this just 
consider the points (—1, 0) and (1, 0) on the punctured plane R 2 — 0. 

Theorem 13.3 (distance topology) Given a Riemannian manifold, define 
the distance function dist as above. Then M, dist is a metric space and the 
induced topology coincides with the manifold topology on M . 

Proof. That dist is true distance function (metric) we must show that 

(1) dist is symmetric, 

(2) dist satisfies the triangle inequality, 

(3) dist(p, q) > 0 and 

(4) dist(p, q) = 0 iff p = q. 

Now (1) is obvious and (2) and (3) are clear from the properties of the 
integral and the metric tensor. To prove (4) we need only show that if p yf q then 
dist(p, q) > 0. Choose a chart if a , U a containing p but not q ( M is Hausdorff). 
Now since il) a (U a ) C R" we can transfer the Euclidean distance to U a and define 
a small Euclidean ball Be U c{Pi r ) i n this chart. Now any path from p to q must 
hit the boundary sphere S(r) = dBEucipA)- Now by compactness of Beuc(pt) 
we see that there are constants Co and C\ such that Cj S t j > gij(x) > C 0 S,j 
for all x G Beuc{p,t) . Now any piecewise smooth curve c : [a, b] — > M from 
p to q hits S(r ) at some parameter value b\ < b where we may assume this is 
the first hit ( i.e. c(f) G Be U c{pt) for a < t < b 0 ). Now there is a curve that 
goes directly from p to q with respect to the Euclidean distance; i.e. a radial 
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curve in the given Euclidean coordinates. This curve is given in coordinates as 
S Pt q(t ) = — t)x(p) + gt l —{t — a)x(q). Thus we have 


length(c) > 


bo d(x i o c) d(x j o c)\ 1/2 ^ i /2 f bo ( d(x i o c) \ 1/2 


S ij ' 


dt 


dt 


dt > Co 


dt 


dt 


Cl' 2 / ° |c'(t)| dt > cl' 2 f° \5' p>q (t)\dt = cl /2 r. 

J a J a 


1 /2 

Thus we have that distfp. q) = inf{length(c) : c a curve from p to </} > C 0 r > 
0. This last argument also shows that if dist(p, x) < C 0 r then x € Be„ c (p, r). 
This means that if B{p, Cl' 2 r) is a ball with respect to dist then B (p, Cl' 2 r) C 
Beuc{pn)- Conversely, if x £ Be U c(p,x) then letting S p>x a “direct curve” 
analogous to the one above that connects p to x we have 


dist(p, x) < length(^ PiX ) 

rb o 


< 


dt 

Cl' 2 [ b °KM^Cl' 2 r 

J a 


so we conclude that B Eu c{p , r) C B(p, Cl' 2 r). Now we have that inside a chart, 
every dist-ball contains a Euclidean ball and visa vera. Thus since the manifold 
topology is generated by open subsets of charts we see that the two topologies 
coincide as promised. ■ 


13.5 Levi-Civita Connection 

In the case of the semi-Riemannian spaces R" one can identify vector fields with 
maps X:R” — > R" and thus it makes sense to differentiate a vector held just as 
we would a function. For instance, if X= (f 1 ,...,f n ) then we can define the 
directional derivative in the direction of v at p £ T p R" = R" by V„X =(D p f 1 ■ 
v, ..., D p f n ■ v) and we get a vector in T p R" as an answer. Taking the derivative 
of a vector held seems to require involve the limit of difference quotient of the 
type 

H m X ^ p+tv ^~ X ^ 

t-> o t 

and yet how can we interpret this in a way that makes sense for a vector held 
on a general manifold? One problem is that p + tv makes no sense if the 
manifold isn’t a vector space. This problem is easily solve by replacing p + tv 
by c(t) where c(0) = v and c(0) = p. We still have the more serious problem 
that X(c(t)) € T c ( t )M while X(p) = X(c(0)) € T p M . The difficulty is that 
T c ( t ) M is not likely to be the same vector space as T p M and so what sense does 
X(c(t)) — X(p) make? In the case of a vector space (like R") every tangent 
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space is canonically isomorphic to the vector space itself so there is sense to be 
made of a difference quotient involving vectors from different tangent spaces. In 
order to get an idea of how we might define V„X on a general manifold, let us 
look again at the case of a submanifold M of R n . Let X £ X(M) and v £ T P M. 
Form a curve with c(0) = v and c(0) = p and consider the composition X o c. 
Since every vector tangent to M is also a vector in R" we can consider X o c to 
take values in R" and then take the derivative 


This is well defined but while X o c(t ) £ T C ^M C T c ( t )R" we only know that 
o c £ T p W n . A good answer should have been in T p M. The simple 
solution is to take the orthogonal projection of j| 0 Xoc onto T C ( 0 )M. Our 
tentative definition is then 


V„X := 


d_ 

dt 


Xoc 

o 




G T P M. 


This turns out to be a very good definition since it turns out that we have the 
following nice results: 

1. (Smoothness) If X and Y are smooth vector fields then the map 


P ^ Vx p T 

is also a smooth vector field on M. This vector filed is denoted 

2. (Linearity over R in second “slot” ) For two vector fields X and Y and any 
a, b £ R we have 


X v (ctXi T bX 2 ) — uX v Xi T bX v X 2 . 

3. (Linearity over C°°(M) in first “slot”)For any three vector fields X, Y 
and Z and any f,g £ C°°(M) we have 

V fx+gyZ = /VxF + gXyZ. 

4. (Product rule) For v £ T p M , X £ X(M) and / £ C°°{M ) we have 

V v fX = f(p)X v X + (vf)X(p) 

= f(p)X v X + df(v)X(p). 

Or in terms of two fields X, Y 

X x fY = fX x Y + (. Xf)Y . 

5. V v (X, Y) = (V v X, Y) + (A, V t ,Y) for all v, X , Y. 
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Now if one takes the approach of abstracting these properties with the aim 
of defining a so called covariant derivative it is a bit unclear whether we should 
define V xY for a pair of fields X, Y or define X V X for a tangent vector v and a 
field X. It turns out that one can take either approach and when done properly 
we end up with equivalent notions. We shall make the following our basic 
definition of a covariant derivative. 

Definition 13.20 A natural covariant derivative (or connection 1 ) V on 

a smooth manifold M is an assignment to each open set U C M of a map 
X u : X(U) x X(U) — * X(U) written X u : ( X , Y) — » V y Y such that the following 
hold: 

1. X x Y is C°°(U) -linear in X, 

2. Vy Y is M.-linear in Y, 

3. X l x (fY) = fX u x Y + (Xf)Y for all I,Y € X(U) and all f € C°°(U). 

4. IfVcU then ?’y(VyF) = X^ Ux ryY (naturality with respect to restric- 
tions). 

5. (Vy Y)(p) only depends of the value of X at p (infinitesimal locality). 

Here Vy- Y is called the covariant derivative ofY with respect to X . We will 
denote all of the maps X u by the single symbol V when there is no chance of 
confusion. 

Definition 13.21 If M is endowed with a semi-Riemannian metric g = (.,.) 
then a connection V on M is called a metric covariant derivative (for g) 

MC X(Y, Z) = (VxY, Z) + (Y, V X Z) 
for all X,Y,Z € X(U) and all UcM 

We have worked in naturality with respect to restriction and infinitesimal 
locality in order to avoid a discussion of the technicalities of localization and 
globalization on infinite dimensional manifolds. If the connection arises from a 
spray or system of Christoffel symbols as defined below then these conditions 
would follow automatically from the rest (see the proposition below). We also 
have the following intermediate result. 

1 It would be better if we could avoid the term connection at this point and use covariant 
derivative instead since later we will encounter another definition of a connection which refers 
to a certain “horizontal” subbundle of the tangent bundle of the total space of a vector 
bundle (or also of a principal bundle). Connections in this sense determine a natural covariant 
derivative. Conversely, a natural covariant derivative determines a horizontal subbundle, i.e. 
a connection in this second (more proper?) sense. Thus a natural covariant derivative and a 
connection are mutually determining. 
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Lemma 13.5 Suppose that M admits cut-off functions and V M : X(M) x 
X(M) — + X(M) is such that (1), (2) and (3) hold (for U = M). Then if 
on some open U either X = 0 orY = 0 then 

(VxY)(p) = 0 for all p € U. 

Proof. We prove the case of Y\ v = 0 and leave the case of X\ v = 0 to the 
reader. 

Let q £ U. Then there is some function / which is identically one on a 
neighborhood V C U of q and which is zero outside of U thus fY = 0 on M 
and so since V M is linear we have V M (fY ) = 0 on M. Thus since (3) holds for 
global fields we have 

V M (/T)(<?) = f(p)(V%Y){q) + (. X q f)Y q 
= (VfF)(g) = 0. 

Since q £ U was arbitrary we have the result. ■ 

In the case of finite dimensional manifolds we have 

Proposition 13.4 Let M be a finite dimensional smooth manifold. Suppose 
that there exist an operator V M : X(M) x X(M) — > X(M) such that (1), (2) and 
(3) hold (for U = M). Then if we set := r^fV^Y) for any extensions of 

X and Y £ X(U) to global fields X and Y £ X(M ) then U i— > X u is a natural 
covariant derivative. 

Proof. By the previous lemma Vyb := r^f (V^Y) is a well defined oper- 
ator which is easily checked to satisfy (1),(2) , (3) and (4) of definition 13.20. 
We now prove property (5). Let a £ T*AI and fix Y £ X(U). define a map 
X(U) — > C°°(U) byTi-4 a(V£y). By theorem 7.2 we see that a(V^-P) depend 
only on the value of X at p £ U. m 

Since many authors only consider finite dimensional manifolds they define a 
covariant derivative to be a map V M : X(M) x X(M) — » X(M) satisfying (1), 
(2) and (3). Later we will study connections and we show how these give rise 
to natural covariant derivatives. 

It is common to write expressions like V _e_X where X is a global field 

dx z 

and is defined only on a coordinate domain U. This still makes sense 
as a field p i— > V u (n] X on U by virtue of (5) or by interpreting V a X as 

V g X\ v and invoking (4) if necessary. Let us agree to call a map V M :: 
X(M) x X(M) — > X(M) such that (1), (2) and (3) hold for U = M a globally 
defined covariant derivative on X(M). We now introduce the notion of a system 
of Christoffel symbols. We show below that if a globally defined covariant 
derivative V M on X(M) is induced by a system of Christoffel symbols then 
defining Vyb := r^j (V-fP) as in the finite dimensional case gives a natural 
covariant derivative. 
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Definition 13.22 A system of Christoff el symbols on a smooth manifold 
M (modelled on M ) is an assignment of a differentiable map 


to every admissible chart. U a ,il) a such that if U a ,xl> a and Up,il >/3 are two such 
charts with U a C\Up 0, then for all p G U a OUp 

£) (#°^a 1 ) -r Q (V) 

= £> 2 0/3 ° C 1 ) + T is(y) ° {D{ipp o ip- 1 ) x d(v>/3 ° c 1 )) 


where y = ipp(p ) and x = ip a (p). For finite dimensional manifolds with ip a = 
(x 1 , ...,x n ) and ipp = (y 1 , ...,y n ) this last condition reads 


d]f_ 

dx k 




d 2 y T f x)+ fr (y( X ))W( X )W, x) 
dx'dxi { ’ + P^ x >> dx i [ ’ dxi [ } 


where r^.(x) are the components of r a (x) andT r pq the components ofTp(y(x)) = 
Tpfipp ° if- 1 (x)) with respect to the standard basis of L(M. n , R"; K"). 


Proposition 13.5 Given a system of Christoffel symbols on a smooth mani- 
fold M there is a unique natural covariant derivative V on M such that the 
principal part ofS7 x Y with respect to a chart. U a ,ip a is given by DY(x).X(x) + 
r a (x)(X(x),Y(x)) for x G tfaO-Ja). Conversely, a natural covariant, derivative 
determines a system of Christoffel symbols. 


Proof. Let a system of Christoffel symbols be given. Now for any open set 
U C M we may let {U a ,ip a } a be any family of charts such that (J Q U a = U. 
Given vector fields X, Y £ 3C(U) we define 

sx,y{Uo) ~ V \x r uj 

to have principal representation 

O' Ot Ot cx.cn 

V^Y = DY-X + r a (X,Y). 
x 

It is straight forward to check that the change of chart formula for Christoffel 
symbols implies that 

r ulnu b s xx(U a ) = sx,y{U a n Uf) = r jj b a (- l jj b s x , y ( fJ a ) 
and so by sheaf theoretic arguments there is a unique section 

Vxb G X(U) 


such that 


r%y x Y = s x , Y (U a ) 
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The verification that this defines a natural covariant derivative is now a straight- 
forward (but tedious) verification of (l)-(5) in the definition of a natural covari- 
ant derivative. 

For the converse, suppose that V is a natural covariant derivative on M. 
Define the Christoffel symbol for a chart U a , if a to be in the following way. For 
fields 

x-> (x, X(x)) 

and 

x — > (x, Y(x)) 

a. a cx a at 

one may define 0(X,Y) := V«Y — DY • X and then use the properties (l)-(5) 

x 

at ex. ex ot 

to show that 0(X,Y)(x) depends only on the values of X and Y at the point 

at ot 

x. Thus there is a function T : U a — > L(M,M;M) such that 0(X, Y)(x) = 

Ot Ot 

T(x)(X(x), Y(x)). We wish to show that this defines a system of Christoffel 
symbols. But this is just an application of the chain rule. ■ 

In finite dimensions and using traditional notation 

/ f)Y k \ f) 

where X = X J and Y = Y^ g|j . In particular, 

V — =T k - — 
afr dxi u dx k 

Proof. The proof follows directly from the local definition and may be easily 
checked by the patient reader. One should check that the transformation law 
in the definition of a system of Christoffel symbols implies that Z?Y(x).X(x) + 
T q (x)(X(x),Y(x)) transforms as the principal local representative of a vector. 


Remark 13.7 We will eventually define an extension of the covariant deriva- 
tive to tensor fields. Let us get a small head start on that by letting V.y/ := Xf 
for f £ C°° (M) . We now have the following three expression for the same thing: 

Vxf := Xf := C x f. 

Notice that with this definition (4) above reads more like a product rule: 

Vx(Y, Z) = (V X Y, Z) + (Y, V X Z). 

Definition 13.23 Define the operator Ty : X(M) x X(M) — > X(M) by 


T{X, Y) = V X Y - X Y X - [X, Y\. 
Ty is called the torsion tensor for the connection V. 
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Theorem 13.4 For a given Riemannian manifold M, g, there is a unique met- 
ric connection V such that its torsion is zero; Tv = 0. This unique connection 
is called the Levi-Civita derivative for M, g. 

Proof. We will derive a formula that must be satisfied by V which can in 
fact be used to define V. Let X, Y, Z, W be arbitrary vector fields on U C M. 
If V exists as stated then on U we must have 

X(Y, Z) = {V X Y, Z) + (Y, V X Z) 

Y{Z , X) = {V Y Z, X) + {Z, Vyl) 

Z(X, Y) = (VzX, Y) + (X, XzY). 

where we have written V f simply as V. Now add the first two equations to the 
third one to get 

X(Y, Z) + Y(Z, X) - Z(X , Y) 

= (Va-T, z) + (Y, S7xZ) + ( VyZ , X) + (Z, Vyl) 

-<v z x, Y)-{X,VzY). 

Now if we assume the torsion zero hypothesis then this reduces to 

X{Y, Z) + Y{Z, X) - Z{X, Y) 

= (Y,[X,Z]) + (X, %Z}) 

~(Z, [X,Y\) + 2(V x Y,Z). 

Solving we see that Va Y must satisfy 

2(Va y z) = x(y z) + y(z, x) - z<x, y) 

(z, [x,y])-<y,[x,z])-<x, [y,z]). 

Now since knowing (Va 'Y, Z) for all Z is tantamount to knowing S7 xY we 
conclude that if V exists then it is unique. On the other hand, the patient 
reader can check that if we actually define (VxY, Z) and hence V xY by this 
equation then all of the defining properties of a connection are satisfied and 
furthermore Ty will be zero. ■ 

It is not difficult to check that we may define a system of Christoffel symbols 
for the Levi Civita derivative by the formula 

T a (X,Y) := V X Y — DY ■ X 

where X, Y and VxY are the principal representatives of X, Y and VxY respec- 
tively for a given chart U a ,ip a . 

Proposition 13.6 Let M be a semi- Riemannian manifold of dimension n and 
let U,ij) = (x 1 , ...,x n ) be a chart. Then we have the formula 

r k = 1 ki dgtj \ 

* 7 2 ” \ dx i dx-i dx l ) ' 

where gj k g kl = 5], 
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13.6 Covariant differentiation of vector fields along 
maps. 

Let F : N — > M be a smooth map. A vector field along F is a map Z : N — > 
TM such that the following diagram commutes: 

TM 

Z / |™ 

N -C M 

We denote the set of all smooth vector fields along a map F by Xp . Let 
F : N — » M be a smooth map and let the model spaces of M and N be M and 
N respectively. 

We shall say that a pair of charts Let U, ip be a chart on M and let V, <p be 
a chart on N such that (p(V) C U. We shall say that such a pair of charts is 
adapted to the map F. 

Assume that there exists a system of Christoffel symbols on M. We may 
define a covariant derivative V x Z of a vector field Z along F with respect to 
a held X £ X(N) by giving its principal representation with respect to any 
pair of charts adapted to F. Then V xZ will itself be a vector fields along F. 

The map F has a local representation Fyjj : V — > ip(U) defined by Fyjj := 
‘ijjoFoip— 1 . Similarly the principal representation Y* : f(V) — > M of Y is given 
by Tipo Zoip~ x followed by projection onto the second factor of ip(U) x M. Now 
given any vector held X £ X(N) with principal representation X : ip(U) — > N 
we define the covariant derivative X xY* of X with respect to Z as that vector 
held along F whose principal representation with respect to any arbitrary pair 
of charts adapted to F is 

DZ(x) ■ X(x) + T(F(x))(DF v ,u(x) ■ X(x), Z(x)). 

The resulting map V : X(N) x Xf — > X(N) has the following properties: 

1. V : X(N) x Xp — > X(N) is C°°(N) linear in the hrst argument. 

2. For the second argument we have 

Xx(fZ) = fX x Z + X(f)Z 

for all / e C°°(N). 

3. If Z happens to be of the form Y o F for some Y £ X(M) then we have 

V x (YoF) = (Vtf-xY)oF. 

4. (X7xZ)(p) depends only on the value of X at p £ N and we write (VxZ)(p) = 
V.y P Z. 
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For a curve c :: M. — and Z :: K. — > TM we define 

XZ 

■= V d/dtZ G £f 

If Z happens to be of the form Y o c then we have the following alternative 
notations with varying degrees of precision: 

VF 

V d/dt(Y ° c) = Vcp)F = X d / dt Y = -jjr 

13.7 Covariant differentiation of tensor fields 

Let V be a natural covariant derivative on M. It is a consequence of proposition 
9.4 that for each X G X(U) there is a unique tensor derivation V.v on T r s {U) 
such that Vjc commutes with contraction and coincides with the given covariant 
derivative on £(£/) (also denoted Vx) and with Cxf on C°°(U). 

To describe the covariant derivative on tensors more explicitly consider T G 
with a 1-form Since we have the contraction F® Y i— > C(F® Y) = Y(F) we 
should have 


VxT(F) = VxC(y®T) 

= C(V*(F®Y)) 

= C(V X Y ® T + Y <g> V X T) 

= T(V a -F) + (V a -T)(F) 

and so we should define (Va'Y)(F) := Vx(Y(F)) — T(VxF). If T G ij then 

S 

(VxT )(Fi, ..., Y s ) = Va(T(Fi, Y a )) - £ T(..., V A -^, ...) 

i = 1 

Now if Y G we apply this to X Z G T\ and get 

(VxVZ) (F) = x(vz(Y)) - xzix x y) 

= Va :(VyZ) — Vv x yZ 

form which we get the following definition: 

Definition 13.24 The second covariant derivative of a vector field Z G T° is 
X 2 Z : (X,F) h- Vxy(Z) = V X (V Y Z) - V Vx yZ 


Definition 13.25 A tensor field Y is said to be parallel if X ^ Y = 0 for all f. 
Similarly, if a : I — > TJ (M) is a tensor field along a curve c : I — > M satisfies 
V g t a — 0 on I then we say that a is parallel along c. Just as in the case of 
a general connection on a vector bundle we then have a parallel transport map 

P{c)t 0 : T s(M) c ( to ) — > T I(M) c (ty 
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Exercise 13.6 Prove that 


V a cr(£) = Km 
€ — *0 


P(cY t+e a(t + e) 
e 


a{t) 


Also, if T G T r s then if c x is the curve 1 1 — > Flf(p) 

V\-T(p) = lim P ( c ^+ £ ( ToF ^(P))^ y °^ Y (P) 

€ — »o e 

The map : X£M — > X£M just defined commutes with contraction. This 
means for instance that 


V.(T*| t ) = V.T«| t . 

Furthermore, if the connection we are extending is the Levi Civita connection 
for semi-Riemannian manifold M, g then 

V^g = 0 for all £ 


To see this recall that 

V*(g <g> Y ® W) = Vcg ®I»y + g0 V 5 X <g> X + g <s> x ® V=X 

which upon contraction yields 

v c (g(x, y)) = (v ?g )(x, y) + g (v 5 x, x) + g (x, v 5 x) 
y) = (v ?g )(x, y) + (v ? x, y) + <x, v*x>. 

We see that Vjg = 0 for all £ if and only if (X, X) = (VjX, X) + (X, V$X) 
for all £, X, X. In other words the statement that the metric tensor is parallel 
(constant) with respect to V is the same as saying that the connection is a 
metric connection. Since we are assuming the connection has torsion zero metric 
connection means we have a Levi-Civita connection. On the other hand, for a 
fixed T G X( ( M ) the map X i— > Vx'L is C°°(M)— linear and so we may view 
V as being a map from X£(M) to X^(M)c°° 0 0}{M) = T(Hom(TM, L\ lt M)). 
We call VT G X((A/ ) (■•-*- 0 Q 1 (A/ j the covariant differential of T. 

13.8 Comparing the Differential Operators 

On a smooth manifold we have the Lie derivative : X£(M) — > X( ( M ) and 
the exterior derivative d : Q k (M) — » fi fc+1 (M) and in case we have a torsion 
free covariant derivative V then that make three differential operators which 
we would like to compare. To this end we restrict attention to purely covariant 
tensor fields X°(M). 

The extended map Vj : T° s ( M ) — » X°(M) respects the subspace consisting 
of alternating tensors and so we have a map 
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which combine to give a degree preserving map 

V, : L alt (M ) - L alt (M) 


or in other notation 

V ? : fi(M) -> fi(M). 

It is also easily seen that not only do we have Vc(a <g) (3) = V^a ® (3 + a (g> V^/3 
but also 

V^(a A 0) = Vja A (3 + a A V^/3 

Now as soon as one realizes that Vw £ Qr(M)c oa ® 0}(M) instead of 
Cl k+1 (M) we search for a way to fix things. By antisymetrizing we get a map 
0 k (M) — » f l k+1 (AI) which turns out to be none other than our old friend the 
exterior derivative as will be shown below. 

Now recall that X(co(Y)) = (V,Yw)(h) + w(VxF) for u> € f 2 1 (M) and X 
and Y vector fields. More generally, for T € T° s ( M j we have 

S 

(Vjc5)(y l9 ,Y S ) =X(S{Y 1 ,^Y s ))-J2S(Y 1 ,...,Y i _ ll V x Y i ,Y i+1 ,.., f Y s ) 

i = 1 

(13-6) 

and a similar formula for the Lie derivative: 


{C x S){Y u ... t Y s ) =X(5(F 1 ,...,y s ))-^5(F 1 ,,..,F i _ tj £^,^+ 1 ,...,n). 

i=l 

(13.7) 

On the other hand, V is torsion free and so CxYi = [X, Yi] = V x Yi — VyW 
and so we obtain 


S 


(£ X 5)(F 1 ,...,F S ) = (Vx5)(F 1 ,...,F s )+^S(y 1 ,..,F i _i,V >; X,F i+1 ,...,y s ). 


i = 1 

(13-8) 

When V is the Levi-Civita connection for the Riemannian manifold M, g we get 
the interesting formula 


(c xg )(Y, z ) = g (v A -y, z) + g (y, v x z) (13.9) 

for vector fields X,Y,Z £ X(M). 


Theorem 13.5 If V is any natural, torsion free covariant derivative on M 
then we have 


dco(X 0 ,X u ...,X k ) 


k 

E 

i = 0 


(— l) i (V Ai w)(X 0 , ..., Xi, ..., -X*;) 


Proof. Recall the formula 
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k 

dto{X o, X lt ...,X k ) = Xi, X k )) 

i= 0 

+ Y, (-l) r+ M[X r ,X s ],X 0 ,...,Xr,^Xs,...,X k ) 

1 <r<s<k 
k 

= Y / (^) l X l (co(X 0 ,...,X l ,...,X k )) 

i= 0 

+ (-l) r+s uj(Xx r X s -Xx r X s ,X 0 ,...,Xr,.--,Xs,---,X k ) 

1 <r<s<k 
k 

^2(-iyXi(u(x 0 ,...,x. l ,...,x k )) 

1=0 

+ (-l) r+1 u(X 0 ,...,Xr,-,Xx r X s ,...,X k ) 

1 <r<s<k 

- Y, (- 1 )MXo,..,Xx s X r ,.,.,Xs,..,X k ) 

1 <r<s<k 
k 

= ( by using 13 - 6 ) 

2=0 
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Formalisms for Calculation 


14.1 Tensor Calculus 

When working with tensors there is no need to feel obliged to use holonomic 
frames (those coming from coordinate systems: dx l ). There are always a 

great variety of frame fields sometimes defined on sets which are larger than 
possible is possible for coordinate frames. Of course within a coordinate chart 
any other frame field (ei,...,e„) and dual frame (e,...,e n ) may be written in 
terms of the coordinate frames as e* = a 1 and e l = bidx 1 . 

From the point of view of arbitrary frame the tensor calculus is much that 
same as it is for coordinate frames. One writes any tensor as 

T = T n ~ ik e h <g> • • • <g> e ik <8 e jl ® • • • <8» e jl 

= Tj e/ 0 e J . 

If two frame fields are related on the overlap of their domains by f t = e : jgj and 
f = C 9 _1 )je j then with := g l r \;; ik k := g l r \g!? 2 ■ ■ ■ g^ and (jT 1 )^ := (s _1 )£.V.r* 

’ (5 -1 )?, we have 


T'J = T!?e H (£e s 

Tj e R g f ® (< g~ l ) J s e s = Tg e R 0 e s 
(T'j 1 gfig-^en ®e s = T£e R ® e s 

and so the transformation law for the tensor components is Tg = gfT'J (g~ l ) J s 
or 

(g~ 1 ) I R T s(g)j = T'j 

( Bv ) = td i — ► b l j id 
( Bv,w ) = (b l j v°)g is w s 
(' v,B t w ) = v % gik{b(t) k jW 3 ) 

(&® j v j )g is w s = gjkvi (b(t)gW s ) 
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b l j 9is = g 3 kb{t) k s 
g ek b' j g is = g' 1 g i i. b( I ) k 
g ej U j g is = Slb(t) k 
bi j = b(t)l 

Vi i-> bi j v j 

14.2 Covariant Exterior Calculus, Bundle- Valued 
Forms 

A moving frame of an open subset U of an n-dimensional C°°— manifold is an 
n-tuple of vector fields X\, ...,X n £ X(U) such that X± (p), ...,X n (p) is a basis 
for T p M for every p £ M. It is easy to see that there is a dual moving frame 
which is an n— tuple of 1— forms 0 1 , 9 n £ X*(U) so that 

<nxj) = 6}. 

Now we can clearly find an open cover {U a } of M with set up like above for 
each: 


U a 

X al ,...,x 

an G X(U a ) 

e 1 a ,....,ei£X*(u a ) 


On possibility is to choose an atlas {U a ,ip a } for M and let X al , ...,X an be the 
coordinate vector fields {^fr, gfs:} so that also {6d , 0”} = {dx l a , dx ”} 
but this is not the only choice and that is where the real power comes from since 
we can often choose the frames in a way that fits the geometric situation better 
than coordinate fields could. For example, on a Riemannian manifold we might 
choose the frames to be orthonormal at each point. This is something that we 
cannot expect to happen with coordinate vector fields unless the curvature is 
zero. 

Returning to a general C' oc -manifold. let us consider the TM-valued 
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Topology 


When science finally locates the center of the universe, some people 
will be surprised to learn they’re not it. 

-Anonymous 


15.1 Attaching Spaces and Quotient Topology 


Suppose that we have a topological space X and a surjective set map / : X — » S 
onto some set S. We may endow S with a natural topology according to the 
following recipe. A subset U C S is defined to be open if and only if f~ 1 {U) is 
an open subset of X. This is particularly useful when we have some equivalence 
relation on X which allows us to consider the set of equivalence classes X/ ~. 
In this case we have the canonical map g : X — > X/ ~ which takes x £ X 
to its equivalence class [x\. The quotient topology is then given as before by 
the requirement that U C S is open iff and only if p _1 ([/) is open in X. A 
common application of this idea is the identification of a subspace to a point. 
Here we have some subspace A C X and the equivalence relation is given by 
the following two requirements: 

If x £ X\A then x ~ y only if x = y 
If x £ A then x ~ y for any y € A 


In other words, every element of A is identified with every other element of A. 
We often denote this space by X/A. 
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XJA 


Figure 15.1: creation of a “hole” 




[A] 

X X/A 


A hole is removed by identification 


It is not difficult to verify that if X is Hausdorff (resp. normal) and A is closed 
then X/A is Hausdorff (resp. normal). The identification of a subset to a point 
need not simplify the topology but may also complicate the topology as shown 
in the figure. 

An important example of this construction is the suspension. If X is a 
topological space then we define its suspension SX to be (X x [0, 1 })/A where 
A := (X x {0}) U (X x {1}). For example it is easy to see that SS 1 = S 2 . More 
generally, SS n ~ x = S n . 

Consider two topological spaces X and Y and subset A C X a closed subset. 
Suppose that we have a map a : A — > B C Y. Using this map we may define 
an equivalence relation on the disjoint union A"[_|U which is given by requiring 
that x ~ a(x) for x £ A. The resulting topological space is denoted X U a Y. 
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XX[0,1] sx 


Y 
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Figure 15.2: Mapping Cylinder 


Attaching a 2-cell 


Another useful construction is that of a the mapping cylinder of a map / : X — » 
Y. First we transfer the map to a map on the base X x {0} of the cylinder 
X x I by 


/O,0) := f(x) 


and then we form the quotient Y Uf (X x I). We denote this quotient by Mf 
and call it the mapping cylinder of /. 
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15.2 Topological Sum 

15.3 Homotopy 



Homotopy as a family of maps. 


Definition 15.1 Let /o,/i '■ X —> Y be maps. A homotopy from fo to f\ is 
a one parameter family of maps {ht : X — > Y : 0 < f < 1} such that ho = fo 
, hi = fi and such that (x,t) i— > ht( x) defines a (jointly continuous) map 
X x [0, 1] — > Y. If there exists such a homotopy we write fo — f\ and say that 
fo is homotopic to f\. If there is a subspace A C X such that h t \A= fo\A for 
all t £ [0, 1] then we say that fo is homotopic to f\ relative to A and we write 
fo - fi(rel A). 

It is easy to see that homotopy equivalence is in fact an equivalence relation. 
The set of homotopy equivalence classes of maps X — > Y is denoted [X, Y] or 
*(X,Y). 

Definition 15.2 Let fo, fi ■ (X, A) — > (Y ,B) be maps of topological pairs. A 
homotopy from fo to f\ is a homotopy h of the underlying maps fo, fi : X — > Y 
such that h t (A) C B for all t £ [0, 1]. If S C X then we say that fo is homotopic 
to f\ relative to S if ht \ S = fo\ S for all t £ [0, 1]. 

The set of homotopy equivalence classes of maps ( X , A) — > ( Y , B) is denoted 
[(X, A), (Y, B)\ or 7 r((X, A ), (Y, B)). As a special case we have the notion of a 
homotopy of pointed maps fo, fi ■ ( X , xo) — > (Y, y Q ). The points £o and yo are 
called the base points and are commonly denoted by the generic symbol *. The 
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Figure 15.3: Retraction onto “eyeglasses” 


set of all homotopy classes of pointed maps between pointed topological spaced 
is denoted [(X, xo), ( Y , y 0 )] or 7 r((X, xo), (Y, y 0 )) but if the base points are fixed 
and understood then we denote the space of pointed homotopy classes as [X, Y] 0 
or 7r(X, y)o- We may also wish to consider morphisms of pointed pairs such 
as / : (X, A , ao) — >■ ( Y , B, b 0 ) which is given by a map / : (X, A) — > (Y , B) such 
that /(ao) = ho- Here usually have ao £ A and b 0 € B. A homotopy between 
two such morphisms, say fo and f\ : (X, A, ao) — > ( Y , B , b 0 ) is a homotopy h of 
the underlying maps (X, A) —+ (Y ,B) such that h t {ao) = b 0 for all t £ [0,1]. 
Clearly there are many variations on this theme of restricted homotopy. 

Remark 15.1 Notice that if fo,fi '■ (X, A) — > ( Y ,yo) are homotopic as maps 
of topological pairs then we automatically have fo ~ fi(rel A). However, this is 
not necessarily the case if {yo} is replaced by a set B C Y with more than one 
element. 

Definition 15.3 A (strong) deformation retraction of X onto subspace A C 
X is a homotopy ft from fo = idx to f\ such that /i(X) C A and ft \ A = idyt 
for all t £ [0,1]. If such a retraction exists then we say that A is a (strong) 

deformation retract of X. 


Example 15.1 Let f t : R"\{0} — » R"\{0} be defined by 

ft( x ) ■= t j^| + (1 - t)x 

for 0 < t < 1. Then f t gives a deformation retraction o/R n \{0} onto /S'” -1 C 

M n . 
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Figure 15.4: Retraction of punctured plane onto S' 1 


Definition 15.4 A map f : X — > Y is called a homotopy equivalence if 

there is a map g :Y — > X such that f o g ~ idy and jo/~ idx ■ The maps are 
then said to be homotopy inverses of each other. In this case we say that X and 
Y are homotopy equivalent and are said to be of the same homotopy type. 
We denote this relationship by X ~ F 

Definition 15.5 A space X is called contractible if it is homotopy equivalent 
to a one point space. 

Definition 15.6 A map f : X — > Y is called null-homotopic if it is homo- 
topic to a constant map. 

Equivalently, one can show that X is contractible iff every map f : X —> Y 
is null-homotopic. 


15.4 Cell Complexes 

Let I denote the closed unit interval and let I n := I x • • • x I be the n-fold 
Cartesian product of / with itself. The boundary of I is dl = {0, 1} and 
the boundary of / 2 is dl 2 = (/ x {0,1}) U ({0,1} x I). More generally, the 
boundary of I n is the union of the sets of the form I x • • • x dl ■ ■ ■ x I. Also, 
recall that the closed unit n-disk D n is the subset of R" given by {|x| 2 < 1} 
and has as boundary the sphere S'” -1 . From the topological point of view the 
pair (/”,£>/”) is indistinguishable from the pair (D",S” _1 ). In other words , 
is homeomorphic to (D”,S" _1 ). 
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There is a generic notation for any homeomorphic copy of I n = D n which 
is simply e". Any such homeomorph of D n is referred to as a closed n-cell. If 
we wish to distinguish several copies of such a space we might add an index to 
the notation as in e”,e 2 ...etc. The interior of e" is called an open n-cell and is 
generically denoted by e". The boundary is denoted by 9e" (or just de n ). Thus 
we always have (e n ,de n ) = (P>”, S' n_1 ). 

An important use of the attaching idea is the construction of so called cell 
complexes . The open unit ball in K™ or any space homeomorphic to it is 
referred to as an open n-cell and is denoted by e n . The closed ball is called a 
closed n-cell and has as boundary the n — 1 sphere. A 0-cell is just a point and 
a 1-cell is a (homeomorph of) the unit interval the boundary of which is a pair 
of points. We now describe a process by which one can construct a large and 
interesting class of topological spaces called cell complexes. The steps are as 
follows: 

1. Start with any discrete set of points and regard these as 0-cells. 

2. Assume that one has completed the n — 1 step in the construction with a 
resulting space A” -1 , construct X n by attaching some number of copies 
of n-cells {e((} a6 A (indexed by some set A) by attaching maps f a : de ” = 

Gjn—1 ^ X n ~ 1 

3. Stop the process with a resulting space X n called a finite cell complex 
or continue indefinitely according to some recipe and let X = (J n>0 
and define a topology on X as follows: A set U C X is defined to be open 
iff U D X n is open in X n (with the relative topology). The space X is 
called a CW-complex or just a cell complex . 

Definition 15.7 Given a cell complex constructed as above the set X n con- 
structed at the n-th step is called the n-skeleton. If the cell complex is finite 
then the highest step n reached in the construction is the whole space and the 
cell complex is said to have dimension n. In other words, a finite cell complex 
has dimension n if it is equal to its own n-skeleton. 

It is important to realize that the stratification of the resulting topological 
space by the via the skeletons and also the open cells that are homeomorphically 
embedded are part of the definition of a cell complex and so two different cell 
complexes may in fact be homeomorphic without being the same cell complex. 
For example, one may realize the circle S 1 by attaching a 1-cell to a 0-cell or 
by attaching two 1-cells to two different 0-cells as in figure 15.5. 

Another important example is the projective space P ra (R) which can be 
thought of as a the hemisphere 5? = {x £ R” +1 : x n+1 > 0} which antipodal 
points of the boundary dSf = S'” -1 identified. But S'" -1 with antipodal point 
identified is just P" _1 (R) and so we can obtain P ra (R) by attaching an ?r-cell 
e n to P” -1 (R) via the attaching map de n = S n ~ l — + P n ~\M.) which is just the 
quotient map of S’" -1 onto P" -1 (R). By repeating this analysis inductively we 






Figure 15.5: Two different cell structures for S 1 . 


conclude that P”(R) can be obtained from a point by attaching one cell from 
each dimension up to n : 


P"(R) = e° U e 2 U • • • U e" 


and so P" (K) is a finite cell complex of dimension n. 
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Chapter 16 

Algebraic Topology 

16.1 Axioms for a Homology Theory 


Consider the category TV of all topological pairs ( X , A) where X is a topological 
space, A is a subspace of X and where a morphism / : (X, A) — > (X',A') is 
given by a map / : X — > X' such that /(A) C A. We may consider the 
category of topological spaces and maps as a subcategory oiTV by identifying 
(X, 0) with X. We will be interested in functors from some subcategory MTV 
to the category Z— QAQ of Z-graded abelian groups. The subcategory MTV 
(tentatively called “nice topological pairs”) will vary depending of the situation 
but one example for which things work out nicely is the category of finite cell 
complex pairs. Let A/. and y) B ^ be graded abelian groups. A morphism 
of Z-graded abelian groups is a sequence {hk} of group homomorphisms hk : 
Ak — > Bk- Such a morphism may also be thought of as combined to give a 
degree preserving map on the graded group; h : Ak — ■> Bk ■ 

A homology theory H with coefficient group G is a covariant functor he from 
a category of nice topological pairs MTV to the category Z —QAQ of Z-graded 
abelian groups: 


h .( (X,A)» H(X,A,G) = j: pGZ H p (X,A,G) 

G ' 1 f ^ f* 

and which satisfies the following axioms (where we write H p (X, 0) = H p (X ) 
etc.): 

1. H p (X, A) = 0 for p < 0. 

2. (Dimension axiom) H p (pt) = 0 for all p > 1 and Ho(pt) = G. 

3. If / : (X, A) — > (X',A') is homotopic to g : (X, A) — > (. X',A ') then 
/* = g* 
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4. (Boundary map axiom) To each pair ( X , A) and each p £ Z there is a 
boundary homomorphism d p : H p (X,A\G ) — > H p _i{A\G) such that for 
all maps / : (X, A) — > {X' , A') the following diagram commutes: 

H p (X, A- G) h H P (X', A'\ G) 

d P [ d p | 

H p .i(A-G) -> H p _i{A'\ G) 

U\A). 

5. (Excision axiom) For each inclusion i : ( B,B fl A) — > [A U B,A) the 
induced map : H(B, B fl A]G) — > H(A\J B,A-, G) is an isomorphism. 

6. For each pair ( X , A) and inclusions i : A X and j : (X, 0) (X, A) 

there is a long exact sequence 


> H P+1 {A) h H P+1 (X) h H p+ i(X, A) 

d P + i i/ 

H p+ i(A)h H p+1 (X)h 

where we have suppressed the reference to G for brevity. 


16.2 Simplicial Homology 

Simplicial homology is a perhaps the easiest to understand in principle. 
And we have 


16.3 Singular Homology 

The most often studied homology theory these days is singular homology. 

16.4 Cellular Homology 

16.5 Universal Coefficient theorem 

16.6 Axioms for a Cohomology Theory 

16.7 DeRham Cohomology 

16.8 Topology of Vector Bundles 

In this section we study vector bundles with finite rank. Thus, the typical fiber 
may be taken to be K” (or C" for a complex vector bundle) for some positive 
integer n. We would also like to study vectors bundles over spaces that are not 
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Figure 16.1: Simplicial complex 



Figure 16.2: Singular 2-simplex 
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necessarily differentiable manifolds; although this will be our main interest. All 
the spaces in this section will be assumed to be paracompact Hausdorff spaces. 
We shall refer to continuous maps simply as maps. In many cases the theorems 
will makes sense in the differentiable category and in this case one reads map 
as “smooth map”. 

Recall that a (rank n) real vector bundle is a triple (tte, E, M) where E 
and M are paracompact spaces and tte ■ E — > M is a surjective map such that 
there is a cover of M by open sets U a together with corresponding trivializing 
maps (VB-charts) (j> a : (U a ) — > U a x R" of the form (j> a = (7Te,<I> q ). Here 

<I> Q : (U a ) — » R" has the property that : E x — > K n is a diffeomorphism 

for each fiber E x := (x). Furthermore, in order that we may consistently 

transfer the linear structure of R” over to E x we must require that when U a fl 
Up ^ 0 and x € U a fl Up then function 

= *p\ Ek ° ^\eI : R" ^ K" 

is a linear isomorphism. Thus the fibers are vectors spaces isomorphic to R". 
For each nonempty overlap U a fl Up we have a map U a fl Up — > GL(n ) 

x l-> < h / 3a;x- 

We have already seen several examples of vector bundles but let us add one 
more to the list: 

Example 16. 1 The normal bundle to S n C R” +1 is the subset N(S n ) of S n x 
R ra+1 given by 

N(S n ) := {(x,v) : x ■ v = 0}. 

The bundle projection ir E(s n ) given by (x,v) i— > x. We may define bundle 
charts by taking opens sets U a C S n which cover S n and then since any (x, v ) € 
7 T N(S n )(Ua) is of the form (x,tx) for some t € R we may define 

4> a : (x,v) = ( x,tx ) i— > (x,t). 

Now there is a very important point to be made from the last example. 
Namely, it seems we could have taken any cover {U a } with which to build the 
VB-charts. But can we just take the cover consisting of the single open set 
U\ := S n and thus get a VB-chart N(S n ) — > S n x R? The answer is that in 
this case we can. This is because N(S n ) is itself a trivial bundle; that is, it 
is isomorphic to the product bundle S n x R. This is not the case for vector 
bundles in general. In particular, we will later be able to show that the tangent 
bundle of an even dimensional sphere is always nontrivial. Of course, we have 
already seen that the Mobius line bundle is nontrivial. 


16.9 deRham Cohomology 

In this section we assume that all manifolds are finite dimensional, Hausdorff, 
second-countable and C°° . We will define the de Rham cohomology of a smooth 
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manifold which will, of course, be a topological invariant. However, the defini- 
tion involves the calculus of differential forms and hence uses the differentiable 
structure of the manifold. 

Definition 16.1 A differential form a £ f l k (M) is called closed if da = 0 and 
is called exact if there exists a form (3 such that a = d/3. 

It is east to check that a linear combination of closed forms is closed and 
that every exact form is closed. Thus if Z k (M) denotes the set of all closed 
forms and B k (M) the set of all exact forms then 

B k (M) = img (d : -> Cl k (M)), 

Z k (M) = ker (d : f l k (M) -> f l k+1 (M)) 

and are real vector spaces and B k (M) C Z k (M). Like all vector spaces, these 
spaces are, a fortiori, also abelian groups. 

Definition 16.2 The quotient (vector) space H k (M) := Z k (M)/B k (M) is 
called the k-th de Rham cohomology group of M. 

We will start be computing two simple cases. First, let M = {p}. That is, 
M consists of a single point and is hence a O-dimensional manifold. In this case, 

( R for k = 0 

n k ({ P }) = z k ({ P }) = \ 

I 0 for k > 0 


Furthermore, B k (M ) = 0 and so 




R for k = 0 
0 for k > 0 


Next we consider the case M = R. Here, Z°(R) is clearly just the constant 
functions and so is (isomorphic to) R. On the other hand, B°( R) = 0 and so 

H°( R) = R. 

Now since d : H 1 (R) — » H 2 (R) = 0 we see that Z : (R) = H 1 (R). If g(x)dx £ 
H 1 (R) then letting 

f(x) '■= / g(x)dx 
Jo 

we get df = g(x)dx. Thus, every H 1 (R) is exact; B 1 (R) = fi 1 (R). We are led to 

H k ( R) =0. 

From this modest beginning we will be able to compute the de Rham cohomology 
for a large class of manifolds. Our first goal is to compute H k (M.) for all k. In 
order to accomplish this we will need a good bit of preparation. The methods are 
largely algebraic and so will need to introduce a small portion of “homological 
algebra” . 
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Definition 16.3 Let R be a commutative ring. A differential R— complex 
is a direct sum of modules C = ® fcgZ C k together with a linear map d : C — > C 
such that d o d = 0 and such that d(C k ) c C k+1 . Thus we have a sequence of 
linear maps 


■C 


k— 1 (jk 


£tk-\-l 


where we have denoted the restrictions d\ C k all simply by the single letter d. 


Let A = ® fcgZ A fc and B = ® fcgZ B k be differential complexes. A map 
/ : A — > B is called a chain map if / is a (degree 0) graded map such that 
d o f = / o g. In other words, if we let f\ A k := f k then we require that 
fk(A k ) C B k and that the following diagram commutes for all k: 


A k ~^ — > A k — > A k +^ — > 

fk— 1 i fk i fk -\- 1 1 

J^k — 1 Qk~\~ 1 ^ 

Notice that if / : A — > B is a chain map then ker (/) and img(/) are complexes 
with ker(/) = ® fc6Z ker(/ fe ) and img(/) = ® fcgZ img(/ fc ). Thus the notion of 
exact sequence of chain maps may be defined in the obvious way. 

Definition 16.4 The k-th cohomology of the complex C = ® fc6Z C k is 

rrk (r ^ , = kcr(d| C' k ) 
img( d\ C k ~ l ) 

The elements of ker(d| C k ) (also denoted Z k (C)) are called cocycles while the 
elements of img(d\C k ~ 1 ) (also denoted B k (C)) are called coboundaries. 

We already have an example since by letting Ll k (M) := 0 for k < 0 we have 
a differential complex d : f l(M) — > O (M) where d is the exterior derivative. In 
this case, H k (0(M)) = H k (M) by definition. The reader may have noticed that 
O (M) is a A differential A— complex as well as a differential C°°(M)— complex. 
In fact, O(M) is a algebra under the exterior product (recall that A : f 1 k (M) x 
Ll l (M) — > Ll l+k (M)). This algebra structure actually remains active at the level 
of cohomology: If a € Z k (M) and [3 € Z l (M) then for any a',/3' € fl k ~ 1 (AI) 
and any ff € O i_1 (A/) we have 

( a + da') A (3 = a A (3 + da' A (3 

= a A (3 + d{a A (3) — (— l)^ 1 ^ A d(3 
= a A (3 + d(a' A (3) 

and similarly a A {(3 + d/3') = a A (3 + d(a A (3'). Thus we may define a product 
H k (M) x H\M) H k +\M) by [a] A [/?] := [a A 0\. 

If / : A — > B is a chain map then it is easy to see that there is a natural 
(degree 0) graded map /* : H — > H defined by 

/*( M) := [f(x)] for x G C k . 
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Definition 16.5 An exact sequence of chain maps of the form 
is called a short exact sequence. 

Associated to every short exact sequence of chain maps there is a long exact 
sequence of cohomology groups: 


/\ 

H k (A) -A H k \B) -C H k (C) 

5 \ 

H k (A) H k (B) H k (C) 

\S 

The maps /* and g* are the maps induced by / and g where the “connector 
map” 5 : H k (C) — > H k (A) is defined as follows: Referring to the diagram below, 
let c € Z k (C) C C k so that dc = 0. 


0 — 

-> A k+1 

f ; B k+l 

9 ) £7&+i 

— > 0 


d T 

d T 

d T 


0 — 

■» A k 

B k 

— — c k - 

— ► 0 


By the surjectivity of g there is an b £ B k with g(b) = c. Also, since g(db) = 
d(g(b)) = dc= 0, it must be that db = f(a) for some a £ A k+1 . The scheme of 
the process is 

c ---» b — -> a. 

Certainly f{da) = d(f(a)) = ddb = 0 and so since / is 1-1 we must have da = 0 
which means that a £ Z k+1 (C). We would like to define <5 ( [c] ) to be [a] but we 
must show that this is well defined. Suppose that we repeat this process starting 
with d = c + dck- i for some Ck-i £ C fc_1 . In our first step we find b' £ B k 
with g(b') = d and then a! with f(a') = db' . We wish to show that [a] = [a']. 
We have g(b — b') = c — c = 0 and so there is an £ A k with /(a*,) = b — b' . 
By commutativity we have 

f(d(a k )) = d(f(a k )) = d(b-b') 

= db — db' = /(a) - f(a') = f(a - a') 

and then since / is 1-1 we have d{a k ) = a — a' which means that [a] = [a'\. 
We leave it to the reader to check (if there is any doubt) that S so defined is 
linear. 

We now return to the de Rham cohomology. If / : M — > N is a C°° map 
then we have /* : f 2 (IV) — > f l(M). Since pull back commutes with exterior 
differentiation and preserves the degree of differential forms, /* is a chain map. 
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Thus we have the induced map on the cohomology which we will also denote by 
/* : 


/* : H*(M) -> H*(M) 

r ■ M - [ra] 

where we have used H*(M) to denote the direct sum Notice that 

/ i— > f* together with M i— > H*(M) is a contravariant functor since if / : M — > 
N and g : N — » P then 

(go /)* = /* og*. 

In particular if ijj : U — * M is inclusion of an open set U then l\jOl is the same 
as restriction of the form a to U. If [a] € H*(M) then f*([a]) G H*{U)\ 

/* : H*(M) -> H*(U). 


16.10 The Meyer Vietoris Sequence 

Suppose that M = U 0 U Ui for open sets U. Let U 0 U U\ denote the disjoint 
union of U and V. We then have inclusions i\ : U-\ —> M and t 2 : U 2 —> M as 
well as the inclusions 


a 0 :f/ o rWi^[/i^[/oU Ui 


and 


di : U 0 n Ui -> U 0 ^ U 0 U Ui 
which we indicate (following [Bott and Tu]) by writing 


^0 do 

Mt= u 0 uUit=u 0 n Ui 

1 1 di 


This gives rise to an exact sequence 

0 f i(M) 4 o(u 0 ) © n(Ui) 4 o(u 0 nt/i)-> o 

where t(u) := (tgW, i\u>) and d*(a,(3) := (d J(/3) — <9*(a)). Notice that CqU> G 
O(f7 0 ) while l*w G Q(Ui). Also, 0^(0) = 0\ UonUl and d*(a) = a\ UonUl and live 
in O(U 0 nUi). 

Let us show that this sequence is exact. First if i(u>) := (l\w, tjw) = (0, 0) 
then u>\jj = oj\u 1 =0 and so to = 0 on M = t/oUC/i thus l* is 1-1 and exactness 
at f l(M) is demonstrated. 

Next, if 77 G 0(Uq fl U\) then we take a smooth partition of unity {po,pi} 
subordinate to the cover {f7 0 , Ui} and then let u := (— (pir ]) u ° , (pov) Ul ) where 
we have extended pi\ v nc/i 77 by zero to a smooth function (piif) u ° on U$ and 



16.11. SHEAF COHOMOLOGY 


253 



Po\u 0 nUi 7 7 a function (poif) Ul on U\ (think about this). Now we have 

9*{-(piv) U °,(poV) Ul ) 

= ((PoV) Ul \ UonUl + (piV) Uo \ UonUl ) 

= Po^luonih + POl\u 0 nih 
= {Po + Pi)v = V- 

Perhaps the notation is too pedantic. If we let the restrictions and extensions 
by zero take care of themselves, so to speak, then the idea is expressed by saying 
that d* maps (— pip, pop) G O(U 0 ) ® to p 0 p - (— pi??) = ?? G O(U 0 fl Ui). 

Thus we see that d* is surjective. 

It is easy to see that d* o t* = 0 so that img(3*) C ker(i*). Finally, let 
(a, P) G O(t/o) ® fi(17i) and suppose that d*(a,P) = (0,0). This translates to 
a \u 0 nu i = P\u 0 nU! which means that there is a form to G H([/ 0 U U\) = f l(M) 
such that lo coincides with a on Uq and with (3 on Uq. Thus 

i OJ = {LqW, tiid) 

= (at, (3) 

so that ker((*) C img(<9*) which together with the reverse inclusion gives img(9*) 
ker(<*). 


16.11 Sheaf Cohomology 

Under Construction 


16.12 Characteristic Classes 
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Chapter 17 

Lie Groups and Lie 
Algebras 

17.1 Lie Algebras 

Let F denote on of the fields R. or C. In definition 7.8 we defined a real Lie 
algebra g as a real algebra with a skew symmetric (bilinear) product (the Lie 
bracket), usually denoted with a bracket v,w i— > [v, w], such that the Jacobi 
identity holds 

[x, [y, 2 ]] + [y, [z, x]] + [z, [ x , y]\ = 0 for all x,y,z € 0 . (Jacobi Identity) 

We also have the notion of a complex Lie algebra define analogously. 

Remark 17.1 We will assume that all the Lie algebras we study are finite 
dimensional unless otherwise indicated. 

Let V be a finite dimensional vector space and recall that g[(V,F) is the set 
of all F— linear maps V — » V. The space g[(V,F) is also denoted HoniF(V, V) or 
L] fr(V, V) although in the context of Lie algebras we take gI(V,F) as the preferred 
notation. We give g[(V,F) its natural Lie algebra structure where the bracket 
is just the commutator bracket 


[A, B\ := A o B - B o A. 


If the field involved is either irrelevant or known from context we will just write 
g[(V). Also, we often identify gt(F n ) with the matrix Lie algebra M na;rl (F) with 
the bracket AB — BA. 

For a Lie algebra g we can associate to every basis Vi,...,v n for g the so 
called structure constants c() defined by 

fai.wj] =^2 c ij v k- 

k 
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It then follows from the skew symmetry of the Lie bracket and the Jacobi identity 
it follow that the structure constants satisfy 


r k _„k 

ij ji 


li) E k C rs C kt + C st C kr + C tr C ka = 0 


(17.1) 


Given a real Lie algebra g we can extend to a complex Lie algebra gc by 
defining as gc the complexification gc := gGujC and then extending the bracket 
by requiring 

[u (g 1, w (g 1 ] = [u, «;] (g 1. 

Then g can be identified with its image under the embedding map v <— >■ v 0 
1 . In practice one often omits the symbol (g and with the aforementioned 
identification of g as a subspace of gc the complexification just amounts to 
allowing complex coefficients. 


Notation 17.1 Given two subsets Si and £2 of a Lie algebra g we let [ Si, £ 2 ] 
denote the linear span of the set defined by {[2,2/] : x £ S 1 and y £ £2}. Also, 
let Si + £2 denote the vector space of all x + y : x £ Si and y £ £2. 


It is easy to verify that the following relations hold: 

1. [£i + £ 2) £] C [£r, £] + [£ 2 , £] 

2. [£i,£ 2 ] = [£ 2 ,£i] 

3. [£, [£ 1 , £ 2 ]] C [[£, £r], £ 2 ] -H [£ 1 , [£, £ 2 ]] 

where £1, £2 and £ are subsets of a Lie algebra g. 

Definition 17.1 A vector subspace a C g is called a subalgebra if [a, a] C g 
and an ideal if [g, a] C g. 

If a is a subalgebra of g and iq, ....,Vk,Vk+ 1, ■■■,v n is a basis for g such that 
Vi , ...., Vfc is a basis for a then with respect to this basis the structure constants 
are such that 

4 = 0 for i,j <k and s > k. 

If a is also an ideal then we must have 


cfj = 0 for i < k and s > k with any j. 

Remark 17.2 Notice that in general c‘f may be viewed as the components of a 
an element (a tensor) o/T 1 1 1 (g). 

Example 17.1 Let su( 2 ) denote the set of all traceless and Hermitian 2 x 2 
complex matrices. This is a Lie algebra under the commutator bracket (AB — 
BA). A commonly used basis for su( 2 ) is ei,e2,e3 where 


1 

' 0 -i ' 

1 

' 0 -1 ' 

1 

' -i 0 ' 

ei= 2 

i 

■ 62= 2 

1 

’ 62= 2 

0 i 
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The commutation relations satisfied by these matrices are 

[6i,6j] — Cijk&k 

where eijk is the totally antisymmetric symbol. Thus in this case the struc- 
ture constants are qt = e.^k- In physics it is common to use the Pauli matri- 
ces defined by ct; := 2i e* in terms of which the commutation relations become 
2ie^'/ c 6fc. 

Example 17.2 The Weyl basis for gl(n, R) is given by the n 2 matrices e sr 
defined by 

( C r .4 ) ij • — SriS s j. 

Notice that we are now in a situation where “double indices'’ will be convenient. 
For instance, the commutation relations read 

\eij , eki\ djkC'ii Sji Ckj 

while the structure constants are 

17.2 Classical complex Lie algebras 

If 0 is a real Lie algebra we have seen that the complexification 0c is naturally 
a complex Lie algebra. Is convenient to omit the tensor symbol and use the 
following convention: Every element of 0c may be written at v + iw for v,w £ 0 
and then 


[vi + iu>i , V2 + IW 2 ] 

= [vi,v 2 \ - [101,102] +i([oi,io 2 ] + [w 1 ,v 2 ])- 

We shall now define a series of complex Lie algebras sometimes denoted by 
A n , B n ,C n and D. n for every integer n > 0. First of all, notice that the com- 
plexification 0[(n,R) c of 0[(n,R) is really just 0l(n,C); the set of complex nxn 
matrices under the commutator bracket. 

The algebra A n The set of all traceless nxn matrices is denoted A n _ 1 and 
also by sl(n, C). 

We call the readers attention to the follow general fact: If &(., .) is a bilinear 
form on a complex vector space V then the set of all A £ 0[(n, C) such that 
b(Az, w) + b(z, Aw) = 0 is a Lie subalgebra of 0[(n, C). This follows from 

b([A, B]z , w) = b(ABz, w) — b(BAz, w) 

= —b(Bz, Aw) + b(Az, Bw) 

= b(z, BAw) - b(z, ABw) 

= b{z , [A, B]w). 
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The algebras B n and D n Let m = 2n + 1 and let b(., .) be a nondegener- 
ate symmetric bilinear form on an m dimensional complex vector space 
V. Without loss we may assume V = C m and we may take b(z, w) = 
i ZiWi. We define B n to be o(m, C) where 

o(m, C) := {A £ gl(m, C) : b{Az , w) + b(z, Aw) = 0}. 

Similarly, for m = 2 n we define D n to be o (to, C). 

The algebra C n The algebra associated to a skew-symmetric nondegenerate 
bilinear form which we may take to be b(z, w) = Z{W n +i— YYi=i z n+iWi 

on C 2n we have the symplectic algebra 

C n = s p(n, C) := {A £ gl(m, C) : b(Az, w) + b{z , Aw) = 0}. 

17.2.1 Basic Definitions 

Definition 17.2 Given two Lie algebras over a field F, say a, [,] a and b, [,](,, 
an ¥ -linear map cr is called a Lie algebra homomorphism iff 

cr([v, tn] a ) = [ov,aw} b 

for all v,w £ a. A Lie algebra isomorphism is defined in the obvious way. A 
Lie algebra isomorphism g —> g is called an automorphism of g. 

It is not hard to show that the set of all automorphisms of g, denoted Aut(g), 
forms a Lie group (actually a Lie subgroup of g[(g)). 

The expected theorems hold for homomorphisms; the image img(cr) := <r(a) 
of a homomorphism a : n — > b is a subalgebra of b and the kernel ker(cr) is an 
ideal of a. 

Definition 17.3 Let f) be an ideal in g. On the quotient vector space g/b with 
quotient map n we can define a Lie bracket in the following way: For v,w £ g/b 
choose v,w £ g with tt(v) = v and n(w) — ui we define 

[h, w\ '.= 7r([u, Ui]). 

We call g/b with this bracket a quotient Lie algebra. 

Exercise 17.1 Show that, the bracket defined in the last definition is well de- 
fined. 

Given two linear subspaces a and b of g the (not necessarily direct) sum 
a + b is just the space of all elements in g of the form a + b where a £ a and 
b £ b. It is not hard to see that if a and b are ideals in g then so is a + b. 

Exercise 17.2 Show that, for a and b ideals in g we have a natural isomorphism 
(a+ b)/b = a/(afl b). 
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Definition 17.4 If g is a Lie algebra, b and c subsets of g then the centralizer 
of b in c is {i> G c :[u, b] = 0}. 

Definition 17.5 If a is a (Lie) subalgebra of g then the normalizer of a in g 
is 

n(a) := {« £ g : [v, a] C a}. 

One can check that n(a) is an ideal in g. 

There is also a Lie algebra product. Namely, if a and b are Lie algebras, 
then we can define a Lie bracket on a x b by 

[(ai, a 2 ), (&i, 62)] := ([ai, &i], [a 2 , b 2 }). 

With this bracket a x b is a Lie algebra called the Lie algebra product of 0 
and b. The subspaces a x {0} and {0} x b are ideals in a xb which are clearly 
isomorphic to a and b respectively. 

Definition 17.6 The center y(g) of a Lie algebra g is the subspace y(g) := 
{x G g : [x, y) = 0 for all y G g}. 


17.3 The Adjoint Representation 

The center y(g) of a Lie algebra g is the largest ideal y such that [g, y] C y. It is 
easy to see that y(g) is the kernel of the map v — * ad(u) where ad(u) G gl(g) is 
given by ad(u)(a:) := [v,x\. 

Definition 17.7 A derivation of a Lie algebra g is a linear map D : g -4 g 
such that 

D[v,w\ = [Dv,w] + [v,Dw] 

for all v,w G g. 

For each v G g the map ad(u) : g — > g is actually a derivation of the Lie 
algebra g. Indeed, this is exactly the content of the Jacobi identity. Further- 
more, it is not hard to check that the space of all derivations of a Lie algebra 
g is a subalgebra of gt(g). In fact, if Di and D 2 are derivations of g then so is 
the commutator D\ o D 2 — D 2 o D\ . We denote this subalgebra of derivations 
by Der(g). 

Definition 17.8 A Lie algebra representation p of g on a vector space V 
is a Lie algebra homomorphism p : g — > g[(V). 

One can construct Lie algebra representations in various way from given 
representations. For example, if pi : g — > gI(V,:) (i = 1 are Lie algebra 
representations then ®p t : g — * gl(®jVj) defined by 


(®*Pi) (x)(vi ® • • • ® v n ) = pi(x)vi ® • • • ® pi(x)v n 


(17.2) 
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for x £ g is a Lie algebra representation called the direct sum representation 
of the pi . Also, if one defines 

{®iPi){x){®iVi) := pi(x)vi (g> v 2 ® ® v k 

+ vi ® p2(x)v2 <8> ■ ■ ■ <8> Vk H 1- vi <8> v 2 <8> • • • <8> Pk{x)vk 

(and extend linearly) then p, ; is a representation Zipi : g — * gl(<S>iV i ) is Lie 
algebra representation called a tensor product representation. 

Lemma 17.1 ad : g — > g 1(g) is a Lie algebra representation on g. The image 
of ad is contained in the Lie algebra Der(g) of all derivation of the Lie algebra 

0 - 


Proof. This follows from the Jacobi identity (as indicated above) and from 
the definition of ad. ■ 

Corollary 17.1 y(g) is an ideal in g. 

The image ad(g) of ad in Der(g) is called the adjoint algebra. 

Definition 17.9 The Killing form for a Lie algebra g is the bilinear form given 
by 

K(X, Y) = Tr{ ad(A) o ad(Y)) 

Lemma 17.2 For any Lie algebra automorphism p : g — > g and any leg w 
have ad(pX) = p&AXp^ 1 

Proof, ad (pX)(Y) = [pX,Y] = [pX,pp _1 Y] = p[X,p _1 Y] = p oadXo 
P~\Y). m 

Proposition 17.1 The Killing forms satisfies identities: 

1) K([X,Y\,Z) = K([Z,X],Y) for all X,Y,Z e g 

2) K(pX, pY) = K(X,Y) for any Lie algebra automorphism p : g —*• g and 
any X,Y € g. 

Proof. For (1) we calculate 

K([X,Y],Z) = Tr(ad([X,Y]) oad(Z)) 

= Tr([adX, adY] o ad(Z)) 

= Tr (adA o adY o ad Z — adY o adX o ad Z) 

= Tr (adZ o adX o adY — adA o ad Z o adY) 

= Tr([adZ, adA] o adY) 

= Tr(ad[Z, X] o adY) = K([Z, A], Y) 


where we have used that Tr(ABC) is invariant under cyclic permutations of 
A,B,C. 
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For (2) just observe that 

K(pX, pY) = Tr{ad{pX) o ad (pY)) 
= Tr{pad{X)p~ 1 pad{Y)p~ 1 ) (lemma 17.2) 
= Tr{pad{X) o ad (Y)p~ l ) 
= Tr{ ad(X) o ad(F)) = K{X, Y). 


Now clearly, K(X,Y) is symmetric in X, Y and so there must be a basis 
{Xi}i<i< n of g for which the matrix ( k-ij ) given by 

hi :=K(Xi,Xj) 


is diagonal. 

Lemma 17.3 If a is an ideal in 0 then the Killing form of a is just the Killing 
form of 0 restricted to a x a. 

Proof. Let {Xj}i<j<„ be a basis of 0 such that {Xj}i<j< r is a basis for a. 
Now since [a, 0] C a, the structure constants ef k with respect to this basis must 
have the property that c- = 0 for i < r < k and all j. Thus for 1 < i, j < r we 
have 


I< a {X u X 3 ) = Tr{ ad(Xi)ad(X,-)) 


E 44 

i,j = 1 


V U r k 
Z . *fc J* 
i,j = 1 


= K s (X i ,X j ). 


17.4 The Universal Enveloping Algebra 

In a Lie algebra 0 the product [., .] is usually not associative. On the other hand 
if 2( is an associative algebra then we can introduce the commutator bracket on 
21 by 

[A,B] := AB — BA 

which gives 21 the structure of Lie algebra. From the other direction, if we start 
with a Lie algebra 0 then we can construct an associative algebra called the 
universal enveloping algebra of 0. This is done, for instance, by first forming 
the full tensor algebra on 0; 

T(0) = F©0©(0 © 0)© • • • ©0® fe © • • • 


and then dividing out by an appropriate ideal: 
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Definition 17.10 Associated to every Lie algebra g there is an associative al- 
gebra 17(g) called the universal enveloping algebra defined by 

u(a) ■■= T( S )/J 

where J is the ideal generated by elements in T(g) of the form X (g> Y — Y ® 
X-[X,Y]. 


There is the natural map of g into 17(g) given by the composition 7r : g 
T(g) — + T(g)/J = 17(g). For v G g, let v* denote the image of v under this 
canonical map. 

Theorem 17.1 Let V be a vector space over the field F. For every p represen- 
tation of g on V there is a corresponding representation p* of U (g) on V such 
that for all v € g we have 

p{v)=p*{v*). 

This correspondence, p i— > p* is a 1-1 correspondence. 

Proof. Given p, there is a natural representation T(p) on T(g). The repre- 
sentation T(p) vanishes on J since 

T(p)(X 0Y-y®I-[I,y]) = p(X)p(Y) - p(Y)p(X) - p([X, Y]) = 0 

and so T(p) descends to a representation p* of 17(g) on g satisfying p(v) = 
p*{v*). Conversely, if er is a representation of 17(g) on V then we put p(X) = 
<r(X*). The map p(X) is linear and a representation since 

p([X,Y}) = a([X,Y}*) 

= ff(i(I®Y-Y®I)) 

= a(X*Y* - Y*X *) 

= p(X)p(Y) - p(Y)p{X) 


for all X,Y £ g. ■ 

Now let Xi,X 2 , ..., X n be a basis for g and then form the monomials Xf X* ■ ■ ■ X 
in 17(g). The set of all such monomials for a fixed r, span a subspace of 17(g) 
which we denote by 17 r (g). Let (f ik be the structure constants for the basis 
X\,X 2 , . . . , X n . Then under the map ir the structure equations become 

[X*,X*] = Y. ck ijX* k . 

k 

By using this relation we can replace the spanning set M r = {X* X* 2 • • • X * } 
for U r {g) by spanning set A4< r for U r (g) consisting of all monomials of the 
form X* X* • • • X* where ii < *2 < • • • < i m and to < r. In fact one can then 
concatenate these spanning sets A4< r and turns out that these combine to form 
a basis for 17(g). We state the result without proof: 
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Theorem 17.2 (Birchoff-Poincare-Witt) Let Ci 1 <i 2 <...<i m = X* X'f ■ ■ ■ 

where i\ < *2 < • - • < i TO . TTie set of all such elements {ej 1 <i 2 <...<, m } for all 
to is a &asis for U(g) and the set {ej 1 <j 2 <...<j m } TO < r is a &asis /or i/ie subspace 

U r ( 0 ). 

Lie algebras and Lie algebra representations play an important role in physics 
and mathematics and as we shall see below every Lie group has an associated 
Lie algebra which, to a surprisingly large extent, determines the structure of the 
Lie group itself. Let us first explore some of the important abstract properties of 
Lie algebras. A notion that is useful for constructing Lie algebras with desired 
properties is that of the free Lie algebra f n which is defined to be the free 
algebra subject only to the relations define the notion of Lie algebra. Every Lie 
algebra can be realized as a quotient of one of these free Lie algebras. 

Definition 17.11 The descending central series of a Lie algebra 0 

is defined inductively by letting 0(i) = 0 and then 0(fe+i) = [0(fe),0]. 

From the definition of Lie algebra homomorphism we see that if a : 0 — > t) 
is a Lie algebra homomorphism then er( 0 (fc)) C l)(k)- 

Exercise 17.3 (!) Use the Jacobi identity to prove that for all positive integers 
i andj, we have [0(»), 0(i)] C 0( i+ j). 

Definition 17.12 A Lie algebra 0 is called k-step nilpotent iff Q(k+i) — 0 but 
fl(fe) 0- 

The most studied nontrivial examples are the Heisenberg algebras which are 
2-step nilpotent. These are defined as follows: 

Example 17.3 The 2n+l dimensional Heisenberg algebra f)„ is the Lie al- 
gebra (defined up to isomorphism) with a basis {X\, ...,X n ,Yi, ...,Y n , Z} subject 
to the relations 

[X j ,Y j ] = Z 

and all other brackets of elements from this basis being zero. A concrete real- 
ization of t) n is given as the set of all (n + 2) x (n + 2) matrices of the form 

0 xi ... x n z 

0 0 ... 0 Vl 


0 0 ... 0 y n 

_ 0 0 ... 0 0 

where Xi,yi,z are all real numbers. The bracket is the commutator bracket as is 
usually the case for matrices. The basis is realized in the obvious way by putting 



264 


CHAPTER 17. LIE GROUPS AND LIE ALGEBRAS 


a lone 1 in the various positions corresponding to the potentially nonzero entries. 
For example, 


0 1 ... 0 0 

0 0 ... 0 0 


and 


X 1 


0 0 
0 0 


0 0 
0 0 


0 0 ... 0 1 

0 0 ... 0 0 


0 0 ... 0 0 

0 0 ... 0 0 


Example 17.4 The space of all upper triangular nxn matrices n n which turns 
out. to be n — 1 step nilpotent. 


We also have the free k-step nilpotent Lie algebra given by the quotient 
f n ,k '■= fn/(.fn)(k) where f n is the free Lie algebra mentioned above. 

Lemma 17.4 Every finitely generated k-step nilpotent Lie algebra is isomorphic 
to a quotient of the free k-step nilpotent Lie algebra. 

Proof. Suppose that 0 is k-step nilpotent and generated by elements X \, ..., X n . 
Let F| , F n be the generators of f n and define a map h : f n — > 0 by sending 
F t to Xj and extending linearly. This map clearly factors through f n since 
h((f n )k) = 0 . Then we have a homomorphism (f n )/ c — » 0 which is clearly onto 
and so the result follows. ■ 


Definition 17.13 Let g be a Lie algebra. We define the commutator series 
{fl^} by letting 0^ = 0 and then inductively g^ = [0^ fc-1 \ 0^ fc-1 ^]. If g^ = 0 
for some positive integer k, then we call 0 a solvable Lie algebra. 

Clearly, the statement 0^ = 0 is equivalent to the statement that 0 is 
abelian. Another simple observation is that 0 (fe ) C 0(fc) so that nilpotency 
implies solvability. 

Exercise 17.4 (!) Every subalgebra and every quotient algebra of a solvable 
Lie algebra is solvable. In particular, the homomorphic image of a solvable Lie 
algebra is solvable. Conversely, if a is a solvable ideal in 0 and g/a is solvable, 
then 0 is solvable. Hint: Use that (g/a)^ = g^ /a. 

It follows from this exercise that we have 

Corollary 17.2 Let h : 0 — > 0 be a Lie algebra homomorphism. //img(/i) := 
h(g) and ker(/i) are both solvable then 0 is solvable. In particular, if img(ad) := 
ad(0) is solvable then so is g. 
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Lemma 17.5 If a is a nilpotent ideal in g contained in the center 3 (g) and if 
g/a is nilpotent then g is nilpotent. 

Proof. First, the reader can verify that (g/a)^ = g^/a. Now if g/a is 
nilpotent then Q(j)/a = o for some j and so C a and if this is the case then 
we have g( J+ i) = [g, g^)] C [g, a] = 0. (Here we have [g, a] = 0 since a C 3 (g).) 

Thus g is nilpotent. ■ 

Trivially, the center 3 (g) of a Lie algebra a solvable ideal. 

Corollary 17.3 Let h : g — > g be a Lie algebra homomorphism. If img(ad) := 
ad(g) is nilpotent then g is nilpotent. 

Proof. Just use the fact that ker(ad) = 3 (g). ■ 

Theorem 17.3 The sum of any family of solvable ideals in g is a solvable ideal. 

Furthermore, there is a unique maximal solvable ideal which is the sum of all 
solvable ideals in g. 

Sketch of proof. The proof is based on the following idea for ideals a and 
b and a maximality argument.. If a and b are solvable then afl b is an ideal in 
the solvable a and so is solvable. It is easy to see that a + b is an ideal. We 
have by exercise 17.2 (a+ b)/b = a/(afl b). Since a/(afl b) is a homomorphic 
image of a we see that a/ (a D b)=(a+ b)/b is solvable. Thus by our previous 
result a + b is solvable. ■ 

Definition 17.14 The maximal solvable ideal in g whose existence is guaran- 
teed by the last theorem is called the radical of g and is denoted rad(g) 

Definition 17.15 A Lie algebra g is called simple if it contains no ideals other 
than {0} and g. A Lie algebra g is called semisimple if it contains no abelian 
ideals (other than {0}/. 

Theorem 17.4 (Levi decomposition) Every Lie algebra is the semi-direct 
sum of its radical and a semisimple Lie algebra. 

Define semi-direct sum be- 
fore this. 

17.5 The Adjoint Representation of a Lie group 

Definition 17.16 Fix an element g £ G. The map C g : G — > G defined by 
C g (x) = gxg ~ 1 is called conjugation and the tangent map T e C g : g — > g is 
denoted Ad ff and called the adjoint map. 

Proposition 17.2 C g : G — > G is a Lie group homomorphism. 

The proof is easy. 

Proposition 17.3 The map C : g 1 — > C g is a Lie group homomorphism G — * 

Aut(G). 
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The image of the map C inside Aut(G) is a Lie subgroup called the group 
of inner automorphisms and is denoted by Int(G). 

Using 8.3 we get the following 

Corollary 17.4 Ad g : g — > g is Lie algebra homomorphism. 

Proposition 17.4 The map Ad : g i— > Acl g is a homomorphism G — * GL(g) 
which is called the adjoint representation ofG. 

Proof. We have 


Ad ( gi g 2 ) = T e C gi92 = T e {C gi o C g2 ) 

= T e C gi o T e C g2 = Ad gi o Ad 92 

which show that Ad is a group homomorphism. The smoothness follows from 
the following lemma applied to the map G : (g,x) e- > C g (x). m 

Lemma 17.6 Let f : M x N — > N be a smooth map and define the partial map 
at x £ M by f x (y) = f{x, y ). Suppose that for every x £ M the point yo is fixed 
by f x : 

fx(yo) = yo for all x. 

The the map A yo : x i— > T yo f x is a smooth map from M to GL (T yo N). 

Proof. It suffices to show that A yo composed with an arbitrary coordinate 
function from some atlas of charts on GL(T yo N) is smooth. But GL(T yo N) 
has an atlas consisting of a single chart which covers it. Namely, choose a 
basis Vi,V2, —,v n of T yo N and let v 1 ,v 2 ,...,v n the dual basis of T* o N, then 
Xj ■ A i—> v l (Avj) is a typical coordinate function. Now we compose; 

Xj°A yo {x) = v'iAy^vj) 

= V 1 {Ty 0 fx -Vj). 

Now it is enough to show that T yo f x ■ Vj is smooth in x. But this is just the 
composition the smooth maps M T M x TN = T{M x N) — > T(N) given by 

x i ^ ((a:, 0), (yo,Vj)) (^/j (x,y 0 ) ■ 0 + (d 2 f) (x,y 0 ) ■ v 3 

= Ty 0 f X ‘ Vj . 

(The reader might wish to review the discussion leading up to lemma 3.4). 


Lemma 17.7 Let v £ g. Then L v (x) = 

Proof. L v (x) = T e (L x ) ■ v = T{R x )T{R x -i)T e {L x ) ■ v = T(R X )T(R X - 1 o 
L x ) ■ v = R Ad U)v_ u 

We have already defined the group Aut(G) and the subgroup Int(G). We 
have also defined Aut(g) which has the subgroup Int(g) := Ad{G). 

We now go one step further and take the differential of Ad. 
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Definition 17.17 For a Lie group G with Lie algebra g define the adjoint 
representation of g, a map ad : g — > gl( g) by 

ad = T e Ad 


The following proposition shows that the current definition of ad agrees with 
that given previously for abstract Lie algebras: 

Proposition 17.5 ad(u)tu = [u, w] for all v,w £ g . 

Proof. Let v 1 , ..., v n be a basis for g so that Ad(x)u; = a i{ x )v l for some 
functions gj. Then we have 


ad(u)w = T e (Ad()w)u 
= d(^2ai()v l )v 
= ^2( da i\ e v)v z 
= J2(L v a t )(eW 

On the other hand, by lemma 17.7 

L w (x) = = R^afix)^) 

= y^g i(x)R v '(x) 

Then we have 

[L\L W ] = [L v ,J2a,.()R v \)\=0 + J2L v (ai)R vi . 

Finally, we have 


[w,v] = [L w , L v ](e) 
= J2L v (a l )(e)R v \e) = £ L t '(a,)(e)u i 

ad(u)w. 


The map ad : g — > gl(g) = End(T e G ) is given as the tangent map at the 
identity of Ad which is a Lie algebra homomorphism. Thus by 8.3 we have the 
following 

Proposition 17.6 ad : g — > gl(g) is a Lie algebra homomorphism. 

Proof. This follows from our study of abstract Lie algebras and proposition 

17.5. ■ 

Lets look at what this means. Recall that the Lie bracket for gl(g) is just 
A o B — B o A. Thus we have 

ad([u, io]) = [ad(u), ad(tu)] = ad(u) o ad(ie) — ad(u>) o ad(u) 
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which when applied to a third vector 2 gives 

[[■ v , w],z] = [v, [w, 2]] - [w, [u, 2]] 

which is just a version of the Jacobi identity. Also notice that using the anti- 
symmetry of the bracket we get 

[2, [u, w}\ = [w, [z, u]] + [u, [2, w]] 

which is in turn the same as 

ad(2)([v, w]) = [aA(z)v,w] + [v,ad(z)w] 

so ad(2) is a derivation of the Lie algebra g as explained before. 

Proposition 17.7 The Lie algebra Der(g) of all derivation of g is the Lie 
algebra of the group of automorphisms Aut(g). The image ad(g) C Der(g) is 
the Lie algebra of the set of all inner automorphisms Int(g). 


ad(g) C Der(g) 

I I 

Int(g) C Aut(g) 

Let pi : G x G — > G be the multiplication map. Recall that the tangent space 
T( g ,h)(G x G) is identified with T g G x T^G. Under this identification we have 

T( g ,h)H(v, w) = T h L g w + T g R h v 

where v £ T g G and w £ T^G. The following diagrams exhibit the relations: 


pr 1 
/ 

G x G, ( g,h ) 

l 

pr 2 

\ 


G,g -+ 

G x G, (5, h) 

<- 

G,h 

\ 

1 M 



Rh 

G,gh 

The horizontal maps are the insertions g 1— > {g 1 

Lg 

h ) and h 1— > 

tangent functor to the last diagram gives. 

Tpri TrgUGxG) 


Tpr 2 

/ 

I 

\ 


TgG 

TgG x T h G 

<- 

T h G 

\ 

i Tg 



TgRh 

TghG 


ThLg 


(g,h). Applying the 


In lieu of a prove we ask the reader to examine the diagrams and try to construct 
a proof on that basis. 
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We have another pair of diagrams to consider. Let v : G — > G be the 
inversion map v : g i— > g~ l ■ We have the following commutative diagrams: 


Rg- 1 


G,g 


Lg-l 


/ 


\ 


G, e 


lv 


G, e 


\ 


/ 


Lg- 1 


G^g- 1 


Rg~l 


Applying the tangent functor we get 


TRg- 1 


TgG 

TLg-, 

\ 

T e G 

\ 

1 Tv 

T e G 

/ 

TLg-l 


Tg-,G 

TR g-l 

express 

here 

is that TgV = TL g 


-L lio lOUUlU V*V *V lull HO vjvpi iiOIV J.U U11UU JL g IS J- J-/ g — X JL J- — X -1 J lsg—± ^ 

TL„-i. Again the diagrams more or less give the proof which we leave to the 


reader. 
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Chapter 18 

Group Actions and 
Homogenous Spaces 


Here we set out our conventions regarding (right and left) group actions and 
the notion of equi variance. There is plenty of room for confusion just from the 
issues of right as opposed to left if one doesn’t make a few observations and set 
down the conventions carefully from the start. We will make the usual choices 
but we will note how these usual choices lead to annoyances like the mismatch 
of homomorphisms with anti-homomorphisms in proposition 18.1 below. 


18.1 Our Choices 

1. We have define the Lie derivative by use of the contravariant functor / i— > 
/* so that L X Y = ^| (FZ^) Y . Notice that we are implicitly using a 
right action of Diff(M) on X(M) 1 . Namely, Y i— > f*Y. 

2. We have chosen to make the bracket of vector fields be defined so that 
[. X , Y] = XY — YX rather than by YX — XY . This makes it true that 
L X Y = [X, Y] so the first choice seems to influence this second choice. 

3. We have chosen to define the bracket in a Lie algebra g of a Lie group 
G to be given by using the identifying linear map 0 = T e G — > X L (M) 
where X L (M) is left invariant vector fields. What if we had used right 
invariant vector fields? Then we would have [X e , Y e \ new = \X ' , Y'] e where 
X' g — TR g ■ X e is the right invariant vector field: 

R* h X'(g) = TR^X'(gh) = TR^TR gh • X e 

= TR ^ o T(R h o R g ) ■ X e = TR g ■ X e 

= x\g) 

x At least when X is complete since otherwise FI * is only a diffeomorphism on relatively 
compact set open sets and even then only for small enough t). 
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But notice that Now on the other hand, consider the inversion map v : 
G — > G. We have v o R g -i = L g ov and also Tv = — id at T e G so 

(v*X')(g) = Tv ■ X'ig- 1 ) = Tv ■ TR g -i • 

= T{L g o v)X e = TL g Tv ■ X e 
= —TL g X e = -X(g) 

thus v*[X',Y'] = [v* X' ,v*Y'] = [— X, — Y] = \X, Y]. Now at e we have 
(v*[X',Y'])(e) =Tvo [X',Y’] o v(e) = ~[X' ,Y'] e . So we have [X,Y] e = 
-[X',Y'} e . 

So this choice is different by a sign also. 

The source of the problem may just be conventions but it is interesting to 
note that if we consider Diff(M) as an infinite dimensional Lie group then 
the vector fields of that manifold would be maps X : Diff(M) — > X(M) 

such X (ip) is a vector field in X(M) such that FI* ^ = f>. In other 
words, a field for every diffeomorphism, a “field of fields” so to speak. 
Then in order to get the usual bracket in X(M) we would have to use right 
invariant (fields of) fields (instead of the conventional left invariant choice) 
and evaluate them at the identity element of Diff(M) to get something 
in Xi<jDiff(Af) = X(M). This makes one wonder if right invariant vector 
fields would have been a better convention to start with. Indeed some 
authors do make that convention. 

18.1.1 Left actions 

Definition 18.1 A left action of a Lie group G on a manifold M is a smooth 
map A : G x M — > M such that A(gi, X(g2,m)) = X(gig2,m)) for all 31,32 G 
G. Define the partial map X g : M — > M by X g (m) = A(g,m) and then the 
requirement is that X : 3 1— > X g is a group homomorphism G — ■> Diff(M). We 
often write X (3, m) as g ■ m. 

Definition 18.2 For a left group action as above, we have for every v £ g, we 
define a vector field v x £ X(M) defined by 

v x (m) = — exp (tv) ■ m 

dt t= 0 

which is called the fundamental vector field associated with the action X. 
Notice that v x (m) = TA( e>TO ) • (u,0). 

Proposition 18.1 Given left action X : G x M A1 of a Lie group G on a 
manifold M, the map X : g 1— > X g is a group homomorphism G — » Diff(M) 
by definition. Despite this, the map X 1— > X x is a Lie algebra anti- 
homomorphism g X(M): 

[u,w] A = -[v X ,W X ] X (M) 
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which implies that the bracket for the Lie algebra Diff(M) of Diff(M) (as an 
infinite dimensional Lie group) is in fact [X, P]jiff(M) '■= 

Proposition 18.2 IfG acts on itself from the left by multiplication L : GxG —> 

G then the fundamental vector fields are the right invariant vector fields! 

18.1.2 Right actions 

Definition 18.3 A right action of a Lie group G on a manifold M is a smooth 
map p : M x G — > M such that p(p(m,g 2 ),gi) = p(m, < 72 < 7 i)) for all gi,g 2 € 

G. Define the partial map p 9 : M — > M by p 9 (m) = p(m,g) and then the 
requirement is that p : g i— > p 9 is a group anti-homomorphism G — » Diff(M). 

We often write p(m , g) as m. • g 

Definition 18.4 For a right group action as above, we have for every v £ q a 
vector field v p £ 3t(M) defined by 

v p {m) = — in • exp (tv) 

dt t ~ o 

which is called the fundamental vector field associated with the right action 

P- 

Notice that v p (m) = Tp^ m ^ ■ (0,f). 

Proposition 18.3 Given right action p : M x G — > M of a Lie group G on a 
manifold M, the map p : g i— > p 9 is a group anti-homomorphism G — »■ Diff(M) 
by definition. However, the map X i— > X x is a true Lie algebra homo- 
morphism 0 — > X(M): 

[^,W] P = [V P ,W P ] X (M) 

this disagreement again implies that the Lie algebra Diff(M) ofiDifi(M) (as an 
infinite dimensional Lie group) is in fact X(M), but with the bracket [ X , P]atff(M) : = 

Proposition 18.4 IfG acts on itself from the right by multiplication L : G x 
G —*■ G then the fundamental vector fields are the left invariant vector fields 

Xl(G)! 

Proof: Exercise. 

18.1.3 Equivariance 

Definition 18.5 Given two left actions Ai : G x M —> M and A 2 : G x S — > S 
we say that a map f : M — > N is (left) equivariant (with respect to these 
actions) if 

f(g-s)=g- f(s) 
i.e. 

/(Ai( 5 ,s)) = a 2 (g,f(s)) 
with a similar definition for right actions. 
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Notice that if A : G x M — * M is a left action then we have an associated 
right action A _1 : M x G — > M given by 

A = A (5 _1 ,p). 

Similarly, to a right action p : M x G — > M there is an associated left action 

p~ 1 {g,p) = 

and then we make the follow conventions concerning equivariance when mixing 
right with left. 

Definition 18.6 Is is often the case that we have a right action on a manifold P 
(such as a principle bundle) and a left, action on a manifold S. Then equivariance 
is defined by converting the right action to its associated left, action. Thus we 
have the requirement 

f( s ' 9~ l ) =9' f(s) 

or we might do the reverse and define equivariance by 

f(s-g) =g~ l ■ f(s) 

18.1.4 The action of Diff(M) and map-related vector fields. 

Given a diffeomorphism $ : M — > N define !>* : T(M,TM) — > T{N,TN) by 

$,i = r$oior 1 
and <f>* : T(M,TN) -► T(M,TM) by 

$T = Tr 1 oIo$ 

If M = N, this gives a right and left pair of actions of the diffeomorphism group 
Diff(M) on the space of vector fields 3t(M) = T(M,TM). 

Diff(M) x £(M) -► X(M) 

(4, AT) i ► M 

and 

jf(M) x Diff(M) -► X(M) 

(X,4) i — ^ <S>*X 

18.1.5 Lie derivative for equivariant bundles. 

Definition 18.7 An equivariant. left action for a bundle E — > M is a pair of 
actions 7 E : G x E —> E and : G x M — > M such that the diagram below 
commutes 

j E : G x E -► E 

I I 

7 : G x M — > M 
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In this case we can define an action on the sections T(E) via 

7gS = ('y E )~ 1 O S O 7g 

and then we get a Lie derivative for X G LG 


^x(s) 


d_ 

dt 


Texp tX® 

0 


18.2 Homogeneous Spaces. 

Definition 18.8 The orbit of x € M under a right action by G is denoted x-G 
or xG and the set of orbits M/G partition M into equivalence classes. For left 
actions we write G- x and G\M for the orbit space. 

Example 18.1 If H is a closed subgroup of G then H acts on G from the right 
by right multiplication.. The space of orbits G/H of this right is just the set of 

left cosets. 

Definition 18.9 A left (resp. right) action is said to be effective if g ■ p = x 
( resp. x ■ g = x) for every x € M implies that g = e and is said to be free if 
g ■ x — x ( resp. x ■ g = x) for even one x € M implies that g = e. 

Definition 18.10 A left (resp. right) action is said to be a transitive action 
if there is only one orbit in the space of orbit which. This single orbit would 
have to be the M. So in other words, given pair x, y G M, there is a g G G with 
9 ■ x = y (resp. x ■ g = y). 


Theorem 18.1 Let X : G x M — ► M be a left, action and fix Xq G M. Let 
H = H Xo be the isotropy subgroup of Xq defined by 


H = {g £ G : g ■ x 0 = x 0 }. 


Then we have a natural bisection 

G ■ x 0 = G/H 

given by g ■ Xq i— > gH. In particular, if the action is transitive then G/H = M 
and xq maps to H. 

Let us denote the projection onto cosets by ir and also write r x ° : g i — > gx q. 
Then we have the following equivalence of maps 

G = G 

7T | | r X0 

G/H = M 

In the transitive action situation from the last theorem we may as well assume 
that M = G/H and then we have the literal equality r x ° = i r. In this case the 
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left action is just lh ■ gH i— > hgH or gx i— > hx. Now H also acts on then we 
get an action r : H x G/H — > G/H given by Th : gH i— > hgH or for getting the 
coset structure Th ■ h i— > hx. Thus r is just the restriction l : G x G/H — > G/H 
to HxG/HcGxG/H and we call this map the translation map. 

G x G/H 

ii 

Gx M 
U 

H x G/H 
ii 

H x M 

For each h £ H the map Th : M — > M fixes the point a’o and so the differential 
T X0 T h maps T Xo M onto itself. Let us abbreviate 2 by writing d,Th '■= T Xo Th ■ So 
now we have a representation p : H —> Gl(T Xo M) given by h i— > dr^. This 
representation is called the linear isotropy and the group p(H) C Gl(T Xo M ) 
is called the linear isotropy subgroup. On the other hand we for each h € H 
have the action Ch '■ G — > G given by g i — > hgh~ x which hxes H and whose 
derivative is Adh : g — > 0. It is easy to see that this map descends to a map 
Adh : g/f ) —> g/f). We are going to show that there is a natural isomorphism 
T Xo M = g/f) such that for each h £ H the following diagram commutes: 

Ad h ■ g/f) ->■ g/h 

I I ( 18 - 1 ) 

dT h ■ T Xo M — *■ T Xo M 


4 G/H = M 

ii 

4 G/H = M 


One way to state the meaning of this result is to say that h i— > Adh is represen- 
tation of H on the vector space g/f) which is equivalent to the linear isotropy 
representation. The isomorphism T Xo M = g/f) is given in the following way: Let 
£ € g and consider T e 7r(£) € T Xq M. If <; € f) then T e 7r(£ + c) = T e 7r(^) +T e n(<;) = 
T £ tt(^) and so £ i— > T e 7r(^) induces a map on g/f). Now if T e 7r(£) = 0 £ T Xo M 
then as you are asked to show in exercise 18.1 below £ £ f) which in turn means 
that the induces map g/f) — > T Xo M has a trivial kernel. As usual this implies 
that the map is in fact an isomorphism since dim(g/f)) = dim(T Xo M). Let us 
now see why the diagram 18.2 commutes. Let us take a the scenic root to 
the conclusion since it allows us to see the big picture a bit better. First the 
following diagram clearly commutes: 

expf£ (4 /i(expf^)ft _1 

7T | 7T | 

(exp t£)H h(expt(/)H 


2 This notation for the tangent map is quite common especially in this type of situation 
where we have a “canonical space” such as g or in this case T Xq M. 
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which under the identification M = G/H is just 

expf£ — * /i(exp tt;)h~ l 

7T l 7T | 

(expf£);ro — »■ /i(expf£)a;o 

Applying the tangent functor (looking at the differential) we get the commuta- 
tive diagram 

£ - M h Z 

I I 

TMO ■ ^ T e n(Ad h O 

and in turn 

[£] |— *■ 

I I 

T e n(£) T e ir(Ad h £) 

This latter diagram is in fact the element by element version of 18.2. 

Exercise 18.1 Show that T e 7r(£) G T Xo M implies that £ G f). 
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Chapter 19 

Fiber Bundles and 
Connections 


Halmos, Paul R. 

Don’t just read it; fight it! Ask your own questions, look for your own 
examples, discover your own proofs. Is the hypothesis necessary? Is the 
converse true? What happens in the classical special case? What about 
the degenerate cases? Where does the proof use the hypothesis? 

Halmos, Paul R. 

I Want to be a Mathematician, Washington: MAA Spectrum, 1985. 


19.1 Definitions 

A ( C r ) fiber bundle is a quadruple ( 7 r, E, M, F) where 7r : E — > M is a 
smooth C r — submersion such that for every p £ M there is an open set U 
containing p with a C r — isomorphism <j> = (n, $) : n~ 1 (U) — > U x F. Let us 
denote the fiber at p by E p = 7 r _1 (p). It follows that for each p £ U the 
map <I ) | F : E p — ■> F is a C' r -diffeomorphism Given two such trivializations 
(7T, $„) : 7r _1 (t/ a ) — r U a x F and : 7r ~ 1 (Up) — > Up x F then there is a 

family of diffeomorphisms | p : E p — > E p where p £ U a (lUp and so for each 
a, (5 we have a map U a fl Up — > Diff(L’) defined by p 1 — > ^ a p(p) = Q a p\ p ■ These 
are called transition maps or transition functions. 

Remark 19.1 Recall that a group action p : G x F — > F is equivalently thought 
of as a representation p : G — > Diff(i r ) given by p(g)(f) = p{g , /)• We will forgo 
the separate notation p and simple write p for the action and the corresponding 
representation. 

Returning to our discussion of fiber bundles, suppose that there is a Lie 
group action p : G x F -4 F such that for each cr, (3 we have 

$ a p(p){f) = p(g a p(p),f) 
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for some smooth map g a/ 3 : U a fl Up — * G then we have presented (n, E, M, F) 
as G bundle under the representation p. We also say that the transition 
functions live in G (via p). In most but not all cases the representation p 
will be faithful, i.e. the action will be effective and so G can be considered 
as a subgroup of Diff(.F). In this case we say simply that ( n,E,M,F ) is a G 
bundle and that the transition functions live in G. It is common to call G 
the structure group but since the action in question may not be effective we 
should really refer to the structure group representation (or action) p. 

A fiber bundle is determined if we are given an open cover U = {U a } and 
maps : U a fl Up — > diff(F) such that for all a,/ 3 , 7 

$aa(p) = id for p € U a 
^ap{p) = ®pa(p) for P£U a nUp 
$ap(p) 0 ®Pj(p) 0 $ 7 a (p) = id for p G U a n Up n U 1 

If we want a G bundle under a representation p then we further require that 
$ ap(p){f ) = p{gap{p)){f) as above and that the maps g a p themselves satisfy 
the cocycle condition: 


g aa (p ) = id for p G U a (19.1) 

9 ap(p) = gpa(p ) for p£U a nUp 
g a p{p) 0 5/37 (p) ° 57a (p) = id for p G U a n Up n U 1 

We shall also call the maps g a p transition functions or transition maps. 
Notice that if p is effective the last condition will be automatic. The family {U a } 
together with the maps form a co cycle and we can construct a bundle by 
taking the disjoint union |_|({ 7 q. x F) = (J U a x F x {a} and then taking the 
equivalence classes under the relation (p, /,/ 3 ) ^ (p, $ a/ g(p)(/), a) so that 

E = (|J U a x F x {a}) / - 


and 7 r([p, /,/?]) = p. 

Let H C G be a closed subgroup. Suppose that we can, by throwing out 
some of the elements of { U a ,& a } arrange that all of the transition functions 
live in H. That is, suppose we have that d ) ap(p)(f) = p(g<xp(p), f) where 
5 a/3 : U a fl Up —> H. Then we have a reduction the structure group (or 
reduction of the structure representation in case the action needs to be 
specified) . 

Next, suppose that we have an surjective Lie group homomorphism h : 
G — > G . We then have the lifted representation p : G x F — ► F given by 
p(g,f) = p(h(g),f). Under suitable topological conditions we may be able to 
lift the maps g a p to maps g a/3 : U a fl Up — > G and by choosing a subfamily 
we can even arrange that the g a p satisfy the cocycle condition. Note that 
dfo/3 (p){f) = p(gapj) = p{h{g a p),f) = p(g a p(p),f)- In this case we say that 
we have lifted the structure representation to p. 
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Example 19.1 The simplest class of examples of fiber bundles over a manifold 
M are the product bundles. These are just Cartesian products M x F together 
with the projection map pr\ : M x F — > M. Here, the structure group can be 
reduced to the trivial group {e} acting as the identity map on F. On the other 
hand, this bundle can also be prolonged to any Lie group acting on F. 

Example 19.2 A covering manifold n : M —> M is a G-bundle where G is the 
permutation group of the fiber which is a discrete (O-dimensional) Lie group. 

Example 19.3 (The Hopf Bundle) Identify S 1 as the group of complex num- 
bers of unit modulus. Also, we consider the sphere S 3 as it sits in C 2 : 

S 3 = {( zi,z 2 ) € C 2 : \z\\ + |^ 2 1 = 1}- 

The group S 1 acts on S 2 by u ■ ( z\,z 2 ) = (uzi,uz 2 ). Next we get S 2 into the 
act. We want to realize S 2 as the sphere of radius 1/2 in R 3 and having two 
coordinate maps coming from stereographic projection from the north and south 
poles onto copies of C embedded as planes tangent to the sphere at the two poles. 
The chart, transitions then have the form w = \/z. Thus we may view S 2 as 
two copies of C, say the z plane Ci and the w plane C 2 glued together under 
the identification (f> : z 1/ z = w 

S 2 = C,U 0 C 2 . 

With this in mind define a map tt : S 3 C C 2 — > S 2 by 

/ x _ f z 2 /z\ £ C 2 C Cl C 2 if Zi / 0 
^ e Ci c Ci U 0 c 2 if z 2 /o ■ 

Note that this gives a well defined map onto S 2 . 

Claim 19.1 u- (zi,z 2 ) = u- (w\,w 2 ) iff n(zi, z 2 ) = n(wi,w 2 ). 

Proof. If u ■ (zi, z 2 ) = u ■ (wi,w 2 ) and z 1 / 0 then w\ / 0 and n(wi,w 2 ) = 
w 2 /wi = uw 2 /uw\ = tt(wi,w 2 ) = 7t(z 1 ,z 2 ) = uz 2 /uzi = z 2 /z 1 = tt(z 1 ,z 2 ). A 
similar calculation show applies when z 2 / 0. On the other hand, if 7r(wi, w 2 ) = 
7r(zi, z 2 ) then by a similar chain of equalities we also easily get that u ■ (wi ,w 2 ) = 
... = 7r(W! ,W 2 ) = 7t(z 1 ,Z 2 ) = ... =U - (Z!,Z 2 ). ■ 

Using these facts we see that there is a fiber bundle atlas on irnopf = 7r : 
S 3 — > S' 2 given by the following trivializations: 

<pi : 7r _1 (Ci) -CixS 1 
<Pi ■ (zi,z 2 ) = (z 2 /z 1 ,z 1 / |zi|) 


<P2 : ^ _1 (C 2 ) ^C 2 xS 1 
Ti '■ (zi,z 2 ) = (zi/z 2 , z 2 / \z 2 \). 


and 
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The transition map is 

% 

( Z,U ) (1 /z, fru) 

M 

which is of the correct form since u h || • u is a circle action. Thus the Hopf 
bundle is an S' 1 — bundle with typical fiber S 1 itself. It can be shown that the 
inverse image of a circle on S 2 by the Hopf projection n Hopf is a torus. Since 
the sphere S 2 is foliated by circles degenerating at the poles we have a foliation 
of S 3 -{two circles} by tori degenerating to circles at the fiber over the two poles. 
Since S 3 \{pole} is diffeomorphic to R 3 we expect to be able to get a picture of 
this foliation by tori. In fact, the following picture depicts this foliation. 


19.2 Principal and Associated Bundles 

An important case for a bundle with structure group G is where the typical fiber 
is the group itself. In fact we may obtain such a bundle by taking the transition 
functions g a p from any effective G bundle E — > M or just any smooth maps 
9af3 : U a n Up — > G that form a cocycle with respect to some cover {U a } 
of M. We let G act on itself by left multiplication and then use the bundle 
construction method above. Thus if {U a } is the cover of M corresponding to 
the cocycle {g a /3} a ,0 then we let 

p = (U U a x G x {a}) / - 

where ( p,g,a ) ^ (p, g a p(p)g, /?) gives the equivalence relation. In this way we 
construct what is called the a principal bundle. Notice that for g £ G we have 
{p, g \,(3) ^ (p, <72, ct) if and only if (p, gig, j3) ^ ( p , <72 <7, ex) and so there is a well 
defined right action on any bundle principal bundle. On the other hand there is 
a more direct way chart free way to define the notion of principal bundle. The 
advantage of defining a principal bundle without explicit reference to transitions 
functions is that we may then use the principal bundle to give another definition 
of a G-bundle that doesn’t appeal directly to the notion of transition functions. 
We will see that every G bundle is given by a choice of a principal G-bundle 
and an action of G on some manifold F (the typical fiber). 

First we define the trivial principal G bundle over U to be the trivial 
bundle pr± : U x G — > M together with the right G action (U x G) x G given by 

(x,9i)g ■= (x,gig). 

An automorphism of the G-space U x G is a bundle map 5 : U x G — > U xG 
such that S(x, gig) = S(x, <71)77 for all <71, <7 £ G and all x £ U. Now 6 must have 
the form given by S(x,g) = (x, A(x, g)) and so 

A (x,g) = A (x,e)g. 

If we then let the function x 1— > A(x, e) be denoted by g$Q then we have 8(x, g) = 
(x, gs(x)g). Thus we obtain the following 



19.2. PRINCIPAL AND ASSOCIATED BUNDLES 


283 


Lemma 19.1 Every automorphism of a trivial principal G bundle over and 
open set U has the form 6 : ( x,g ) i— > (x,gs(x)g) for some smooth map g$ : U — > 

G. 

Definition 19.1 A principal G-bundle is a fiber bundle 7 rp : P — > M to- 
gether with a right G action P x G — > P which is locally equivalent as a right G 
space to the trivial principal G bundle over M. This means that for each point 
x £ M there is an open neighborhood U x and a trivialization 

t t- p \Ux) -£ U x xG 

\ / pn 

U x 

which is G equivariant. Thus we require that 4>{pg) = 4>(p)g. We shall call such 
a trivialization an equivariant trivialization. 

Note that <f>(pg) = (tt p(pg),&(pg)) while on the other hand <j>(p)g = ( 7 rp(pg),$(p)g) 
so it is necessary and sufficient that &(p)g = <h(jpg). Now we want to show that 
this means that the structure representation of 7Tp : P — » M is left multiplica- 
tion by elements of G. Let </>i, U± and <p 2 , U 2 be two equivariant trivializations 
such that C/i n C/ 2 ^ 0. On the overlap we have the diagram 

u 1 nu 2 xG th 7 rf 1 (u 1 nu 2 ) ^ u 1 nu 2 xG 

\ I / 

u 1 nu 2 

The map f >2 0 ff 1 \u 1 nUo c l ear ly must an G-bundle automorphism of U\ (~l U 2 x 
G and so by 19.1 must have the form ^2 0 ff 1 \ v nU2 (x, g) = (x, $12 (x)g). 

We conclude that a principal G-bundle is a G-bundle with typical fiber G as 
defined in section 19.1. The maps on overlaps such as <l>i 2 are the transition 
maps. Notice that $ 12 (x) acts on G by left multiplication and so the structure 
representation is left multiplication. 

Proposition 19.1 Ifnp : P M is a principal G-bundle then the right action 
P x G — > P is free and the action restricted to any fiber is transitive. 

Proof. Suppose p € P and pg = p for some g € G. Let 7rp 1 (f7 a; ) U x x G 
be an (equivariant) trivialization over U x where U x contains 7 Tp(p) — x. Then 
we have 


(x,gog) = (x,go) => 

9o9 = 9o 

and so g = e. 

Now let P x = 7Tp 1 (x) and let pi,P 2 £ P x . Again choosing an (equivariant) 
trivialization over U x as above we have that 4>{pi) = (x,gf) and so letting g := 
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g i 1 (?2 we have (j>(pig) = ( x,g\g ) = {x,g 2 ) = <f>{p- 2 ) and since </> is injective 
Pi9 = P2 ■ ■ 

The reader should realize that this result is in some sense “obvious” since 
the upshot is just that the result is true for the trivial principal bundle and 
then it follows for the general case since a general principal bundle is locally 
G-bundle isomorphic to a trivial principal bundle. 

Remark 19.2 Some authors define a principal bundle to be fiber bundle with 
typical fiber G and with a free right action that is transitive on each fiber. 

Our first and possibly most important example of a principal bundle is the 
frame bundle of a smooth manifold. The structure group is the general linear 
group GL(n, R). We define a frame at (or above) a point x G M to be a basis 
for the tangent space T X M. Let F X (M) be the set of all frames at x and define 

F(M) = |J F X (M). 

xeM 

Also, let ttf(m) '■ F(M) — > M be the natural projection map which takes any 
frame f x G F{M) with f x G F X (M) to its base x. We first give F(M) a smooth 
atlas. Let us adopt the convention that (f x )i '■= fi is the i-tli vector in the 
basis f a , = (/i , .. Let {U a ,il> a } ae A be an atlas for M. For each chart 
U a ,i> a — (z 1 ! x 1 ) on M shall define a chart FU a , Fxf a by letting 

FU a :=F-\ M) {U a )= |J F X (M) 

xeu a 

and 

FM f x ) := (. fij ) G R" xn 

where fij is the n 2 numbers such that (f x )j = J2 fj o-x' ./ ' ^ eave ^ t0 the 
reader to find the change of coordinate maps and see that they are smooth. The 
right GL(n,M) action on F(M) is given by matrix multiplication (f x ,g) i— > f x g 
where we think of f x as row of basis vectors. 



Chapter 20 

Analysis on Manifolds 


The best way to escape from a problem is to solve it. 
-Alan Saporta 


20.1 Basics 

20.1.1 Star Operator II 

The definitions and basic algebraic results concerning the star operator on a 
scalar product space globalize to the tangent bundle of a Riemannian manifold 
in a straight forward way. 

Definition 20.1 Let M, g be a semi- Riemannian manifold. Each tangent space 
is a scalar product space and so on each tangent space T p M we have a metric 
volume element vol p and then the map p i— > vol p gives a section of /\ n T*M 
called the metric volume element of M, g. Also on each fiber f\T*M of /\T*M 
we have a star operator * p : /\ k T*M — > /\" k T*M. These induce a bundle 
map * : f\ k T*M — * f\ n k T*M and thus a map on sections (i.e. smooth forms) 

*:n k (M)-*n n ~ k (M). 

Definition 20.2 The star operator is sometimes referred to as the Hodge star 
operator. 

Definition 20.3 Globalizing the scalar product on the Grassmann algebra we 
get a scalar product bundle where for every T),w £ Qr(M ) we 

have a smooth function (g, oj) defined by 

V l— > (??(pWp)) 

and thus a C 00 (M ) -bilinear map (., .) : x O k (M) —> C°°(M). Declaring 

forms of differing degree to be orthogonal as before we extend to a C°° bilinear 
map (., .) : fi(M) x f 2(M) -> C°°(M). 
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Theorem 20.1 For any forms rj,u) £ O(M) we have ( rj,L 0 ) vol = g A *tu 

Now let M, g be a Riemannian manifold so that g =(., .) is positive definite. 
We can then define a Hilbert space of square integrable differential forms: 

Definition 20.4 Let an inner product be defined on f 1 C (M), the elements of 
f l(M) with compact support, by 

:= / 77 A *lo = / ( 77 , 0 ;) vol 

Jm Jm 

and let L 2 (Ll(M)) denote the L 2 completion of O c (M) with respect to this inner 
product. 

20.1.2 Divergence, Gradient, Curl 

20.2 The Laplace Operator 

The exterior derivative operator d : 0 k (M) — » f 1 k+1 (M) has a formal adjoint 
6 : 0 k+1 (M) — y O k (A I) defined by the requirement that for all a, (3 £ fl k (M) 
with compact support we have 

(da, (3) = (a, 5(3). 

On a Riemannian manifold M the Laplacian of a function / € C(M) is given 
in coordinates by 


A / 


1 

V9 


J2 d j(9 jk V9d k f) 

j,k 


where g lJ is the inverse of gij the metric tensor and g is the determinant of the 
matrix G = (gij). We can obtain a coordinate free definition as follows. First 
we recall that the divergence of a vector field X £ X(M) is given at p £ M by 
the trace of the map XX\ T ^ M . Here XX \ T M is the map 


v 1— > V V X. 


Thus 

div(X)(p) := tr(VX\ TpM ). 

Then we have 

A/ := div(grad(f)) 

Eigenvalue problem: For a given compact Riemannian manifold M one is 
interested in finding all A £ R such that there exists a function f ^ 0 in spec- 
ified subspace S C L 2 (M) satisfying A / = A / together with certain boundary 
conditions in case dM ^ 0. 

The reader may be a little annoyed that we have not specified S more clearly. 
The reason for this is twofold. First, the theory will work even for relatively 
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compact open submanifolds with rather unruly topological boundary and so 
regularity at the boundary becomes and issue. In general, our choice of S will 
be influenced by boundary conditions. Second, even though it may appear 
that S must consist of C 1 2 functions, we may also seek “weak solutions” by 
extending A in some way. In fact, A is essentially self adjoint in the sense that 
it has a unique extension to a self adjoint unbounded operator in L 2 (M) and 
so eigenvalue problems could be formulated in this functional analytic setting. 
It turns out that under very general conditions on the form of the boundary 
conditions, the solutions in this more general setting turn out to be smooth 
functions. This is the result of the general theory of elliptic regularity. 

Definition 20.5 A boundary operator is a linear map b : S — ► C°(dM). 

Using this notion of a boundary operator we can specify boundary conditions 
as the requirement that the solutions lie in the kernel of the boundary map. In 
fact, the whole eigenvalue problem can be formulated as the search for A such 
that the linear map 


(A - A) © b : S -► L 2 (M ) © C°(dM) 

has a nontrivial kernel. If we find such a A then this kernel is denoted E\ C 
L 2 (Af) and by definition A / = A / and bf = 0 for all / £ E\. Such a function 
is called an eigenfunction corresponding to the eigenvalue A. We shall see 
below that in each case of interest (for compact M) the eigenspaces E\ will be 
finite dimensional and the eigenvalues form a sequence of nonnegative numbers 
increasing without bound. The dimension dim(£A) is called the multiplicity 
of A. We shall present the sequence of eigenvalues in two ways: 

1. If we write the sequence so as to include repetitions according to multiplic- 
ity then the eigenvalues are written as 0 < Ai < A 2 < . . . | 00 . Thus it is 
possible, for example, that we might have A 2 = A 3 = A 4 if dim(£A 2 ) = 3. 

2. If we wish to list the eigenvalues without repetition then we use an overbar: 

0 < Ai < A 2 < . . . t 00 


The sequence of eigenvalues is sometimes called the spectrum of M. 

To make thing more precise we divide things up into four cases: 

The closed eigenvalue problem: In this case M is a compact Riemannian 
manifold without boundary the specified subspace of L 2 (M) can be taken to be 
C 2 (M). The kernel of the map A — A : C 2 (M) — > C°(M) is the A eigenspace 
and denoted by E\ It consists of eigenfunctions for the eigenvalue A. 

The Dirichlet eigenvalue problem: In this case M is a compact Rie- 

O 

mannian manifold without nonempty boundary dM . Let M denote the interior 

O 

of M. The specified subspace of L 2 {M) can be taken to be C 2 (M)r\C°(M) and 
the boundary conditions are / dM = 0 (Dirichlet boundary conditions) 
so the appropriate boundary operator is the restriction map bn '■ f ' — » f\ dM. 
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The solutions are called Dirichlet eigenfunctions and the corresponding se- 
quence of numbers A for which a nontrivial solution exists is called the Dirichlet 
spectrum of M. 

The Neumann eigenvalue problem: In this case M is a compact Rie- 
mannian manifold without nonempty boundary dM but . The specified sub- 

O 

space of L 2 {AI) can be taken to be C 2 {AI) D C 1 (M). The problem is to find 

O 

nontrivial solutions of A / = A / with / G C 2 (M) n C°(dM) which satisfy 
vf\dM = 0 (Neumann boundary conditions). Thus the boundary map 
here is : C 1 (M) — > C°(dM) given by / i— > vf\dM where v is a smooth 
unit normal vector field defined on dM and so the vf is the normal derivative 
of / . The solutions are called Neumann eigenfunctions and the correspond- 
ing sequence of numbers A for which a nontrivial solution exists is called the 
Neumann spectrum of M. 

Recall that the completion of C k (M) (for any k > 0) with respect to the 
inner product 

(/,<?)= [ fgdv 

J M 

is the Hilbert space L 2 (AI). The Laplace operator has a natural extension 
to a self adjoint operator on L 2 (M) and a careful reformulation of the above 
eigenvalue problems in this Hilbert space setting together with the theory of 
elliptic regularity lead to the following 

Theorem 20.2 1 ) For each of the above eigenvalue problems the set of eigen- 
values (the spectrum) is a sequence of nonnegative numbers which increases 
without bound: 0 < Ai < A2 < • • • T 00. 

O 

2) Each eigenfunction is a C°° function on M = MU dM. 

3) Each eigenspace E (or E d or E^ ) is finite dimensional, that is, each 
eigenvalue has finite multiplicity. 

4) If (p^. r, mi is an orthonormal basis for the eigenspace E^_ (or E® 

or E^) then the set B = is a complete orthonormal set for 

L 2 (M). In particular, if we write the spectrum with repetitions by multiplic- 
ity, 0 < Ai < A2 < . . . T 00 , then we can reindex this set of functions B as 

^ 2 ) <^ 3 ) •••} to obtain an ordered orthonormal basis for L 2 (AI) such that g)i 
is an eigenfunction for the eigenvalue A; . 


The above can be given the following physical interpretation. If we think 
of AI as a vibrating homogeneous membrane then the transverse motion of the 
membrane is described by a function / : M x (0,oo) — > R satisfying 


d 2 f 


and if dM ^ 0 then we could require f\dM x (0,oo) = 0 which means that we 
are holding the boundary fixed. A similar discussion for the Neumann boundary 
conditions is also possible and in this case the membrane is free at the boundary. 
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If we look for the solutions of the form f(x,t ) = <j>(x)T(t) then we are led to 
conclude that <f> must satisfy A <f> = A <f> for some real number A with <f> = 0 on 
dM. This is the Dirichlet eigenvalue problem discussed above. 

Theorem 20.3 For each of the eigenvalue problems defined above 

Now explicit solutions of the above eigenvalue problems are very difficult to 
obtain except in the simplest of cases. It is interesting therefore, to see if one 
can tell something about the eigenvalues from the geometry of the manifold. 
For instance we may be interested in finding upper and/or lower bounds on 
the eigenvalues of the manifold in terms of various geometric attributes of the 
manifold. A famous example of this is the Faber Krahn inequality which states 
that if Q is a regular domain in say R" and D is a ball or disk of the same 
volume then 


A(ft) > X(D) 

where A(f2) and A (D) are the lowest nonzero Dirichlet eigenvalues of O and D 
respectively. Now it is of interest to ask whether one can obtain geometric infor- 
mation about the manifold given a degree of knowledge about the eigenvalues. 
There is a 1966 paper by M. Kac entitled “Can One Hear the Shape of a Drum?” 
which addresses this question. Kac points out that Weyl’s asymptotic formula 
shows that the sequence of eigenvalues does in fact determine the volume of the 
manifold. Weyl’s formula is 


(^fc) 


1/2 


( 2 ttY 


vol(M) 


as k 


where u> n is the volume of the unit ball in R™ and M is the given compact 
manifold. In particular, 


(2tt)" \ k 

UJn Jkloo( A fc )™ /2 


vol(M). 


So the volume is indeed determined by the spectrum 1 . 


20.3 Spectral Geometry 

20.4 Hodge Theory 

20.5 Dirac Operator 

It is often convenient to consider the differential operator D = i-j^ instead of 
even when one is interested mainly in real valued functions. For one thing 

1 Notice however, one may ask still how far out into the spectrum must one “listen” in 
order to gain an estimate of vol(M) to a given accuracy. 
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n a2 

D = an< l so D provides a sort of square root of the positive Euclidean 

o2 

Laplacian A = —-§^2 hr dimension 1. Dirac wanted a similar square root for the 
wave operator □ = dg — (the Laplacian in R 4 for the Minkowski inner 

metric) and found that an operator of the form D = d 0 — 7 A would do 

the job if it could be arranged that 7*7^ + 7^7 = 2 r/ij where 


(Vij) = 


1 0 

0 -1 

0 0 

0 0 


0 0 

0 0 

-1 0 

0 -1 


One way to do this is to allow the 7 , to be matrices. 

Now lets consider finding a square root for A = — df. We accomplish 
this by an R- linear embedding of R" into an N x N real or complex matrix 
algebra A by using n linearly independent matrices {7,: : i = 1,2, ...,n} ( so 
called “gamma matrices”) and mapping 

(a; 1 , ..., x n ) 1— > ai*7 i (sum). 

and where 7-1 , ..... 7„ are matrices satisfying the basic condition 

lilj + lilj = -2 6ij. 

We will be able to arrange 2 that {1, 71, ...., 7„} generates an algebra of dimension 
2” and which is spanned as vector space by the identity matrix 1 and all products 
of the form 7q • • • 7 \ k with i\ < *2 < • • • < ik- Thus we aim to identify R" with 
the linear span of these gamma matrices. Now if we can find matrices with the 
property that 7^7 + 7*7^ = —2 Sij then our “Dirac operator” will be 


n 

D = Y, 7 jdj 

i—1 

which is now acting on iV-tuples of smooth functions. 

Now the question arises: What are the differential operators di = acting 
on exactly. The answer is that they act on whatever we take the algebra spanned 
by the gamma matrices to be acting on. In other words we should have some 
vector space S which is a module over the algebra spanned by the gamma 
matrices. Then we take as our “fields” smooth maps / : R" — > S. Of course 
since the 7* £ Mjvxiv we may always take S = R w with the usual action of 
MatxJV on R w . The next example shows that there are other possibilities. 


2 It is possible that gamma matrices might span a space of half the dimension we are 
interested in. This fact has gone unnoticed in some of the literature. The dimension condition 
is to assure that we get a universal Clifford algebra. 
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Example 20.1 Notice that with ^ := and : = we have 


2 

i o 

0 

_d_d_ 

dz dz 


d 1 r 


0 - 
_9_ 

dz 

JLJL 

dz dz 

0 


A 1 

dz 

0 



' A 

0 


0 

A 


where A = — < 9 / 


= A 1 

On the other hand 


0 

A 

dz 


_A 1 

dz 

0 


0 -1 

1 0 


d_ 

dx 


0 i 
-i 0 


d_ 

dy' 


From this we can see that appropriate gamma matrices for this case are 71 = 


-1 0 

0 1 


and 72 


/ 0 
0 -i 


Now let E° be the span of 1 = ^ ^ ^ ^ and 7271 


-i 0 

0 -i 


Let 


E 1 be the span of 71 and 72 • Refer to E° and E 1 the even and odd parts of 


Span{ 1 , 71, 72) 727i}- Then we have that D — 


n dz 

L 5=z ° 


maps E° to E 1 


and writing a typical element of E° as f(x,y) = u(x,y) + 7271 v(x,y) is easy 
to show that Df = 0 is equivalent to the Cauchy -Riemann equations. 


The reader should keep this last example in mind as this kind of decompo- 
sition into even and odd part will be a general phenomenon below. 


20.5.1 Clifford Algebras 

A Clifford algebra is the type of algebraic object that allows us to find differential 
operators which square to give Laplace type operators. The matrix approach 
described above is in fact quite general but there are other approaches which are 
more abstract and encourage one to think about a Clifford algebra as something 
that contains the scalar and the vector space we start with. The idea is similar 
to that of the complex numbers. We seldom think about complex numbers as 
“pairs of real numbers” while we are calculating unless push comes to shove. 
After all, there are other good ways to represent complex numbers; as matrices 
for example. And yet there is one underlying abstract object called the complex 
numbers which ironically is quite concrete once one get used to using them. 
Similarly we encourage the reader to learn to think about abstract Clifford 
algebras in the same way. Just compute! 

Clifford algebras are usually introduced in connection with a quadratic form 
q on some vector space but in fact we are just as interested in the associated 
symmetric bilinear form and so in this section we will generically use the same 
symbol for a quadratic form and the bilinear form obtained by polarization and 
write both q(v) and q(v,w). 
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Definition 20.6 Let V be an n dimensional vector space over a field K with 
characteristic not equal to 2. Suppose that q is a quadratic form on V and let q 
be the associated symmetric bilinear form obtained by polarization. A Clifford 
algebra based on Y,q is an algebra with unity 1 C7(V,g, IK) containing V (or 
an isomorphic image of V) such that the following relations hold: 

vw + wv = —2 q(v, tu)l 

and such that Cl(Y,q, IK) is universal in the following sense: Given any linear 
map L : V — > A into an associative K —algebra with unity 1 such that 

L{v)L{w) + L(w)L(v) = — 2q(v,w)l 

then there is a unique extension of L to an algebra homomorphism L : Cl(V,q,K) 

A. 


If ei,...,e n is an orthonormal basis for V, q then we must have 

e, ej + ejei = 0 for i ^ j 

e? = —q(ei) = ±1 or 0 

A common choice is the case when q is a nondegenerate inner product on a real 
vector space. In this case we have a particular realization of the Clifford algebra 
obtained by introducing a new product into the Grassmann vector space AV. 
The said product is the unique linear extension of the following rule for v £ A 1 V 
and w € A fe V: 


v ■ w := v A w — v° a w 
w ■ v := (— l) fc (u A w + v b j w) 

We will refer to this as a geometric algebra on AV and this version of the 
Clifford algebra will be called the form presentation of Cl(V,q). Now once 
we have a definite inner product on V we have an inner product on V* and 
V = V*. The Clifford algebra on V* is generated by the following more natural 
looking formulas 


a ■ (3 := a A (3 — (jja) j/3 
/?•<*:= (— l) fc (aA /?+(#<*)_■/?) 

for a e A 1 V and /? € AV. 

Now we have seen that one can turn AV (or AV* ) into a Clifford algebra and 
we have also seen that one can obtain a Clifford algebra whenever appropriate 
gamma matrices can be found. A slightly more abstract construction is also 
common: Denote by I(q) the ideal of the full tensor algebra T(V) generated 
by elements of the form x <S> x — q(x) ■ 1. The Clifford algebra is (up to 
isomorphism) given by 


Cl(V,q,K) = T(V)/I(q). 
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We can use the canonical injection 

* : V — * C K 

to identify V with its image in ClfV.q, IK. (The map turns out that i is 1-1 onto 
i(V) and we will just accept this without proof.) 

Exercise 20.1 Use the universal property ofCl(V,q, K) to show that it is unique 
up to isomorphism. 

Remark 20.1 Because of the form realization of a Clifford algebra we see that 
AV is a Cl(V , q,M.)— module. But even if we just have some abstract Cl(V,q,M) 
we can use the universal property to extend the action of V on AV given by 

v v ■ w := v A w — v” jw 

to an action of Cl(V, q, IK) on AV thus making AV a Cl(V , q,M)— module. 

Definition 20.7 Let R™ r s) be the real vector space R n with the inner product 
of signature (r, s) given by 


r r-\-s—n 

(x,y) := y ^Xiyi - Y Xipi. 

i—1 i=r -\- 1 

The Clifford algebra formed from this inner product space is denoted Cl r ^ s . In 
the special case of ( p , q) = (n, 0) we write Cl n . 

Definition 20.8 Let C n be the complex vector space of n-tuples of complex 
numbers together with the standard symmetric C— bilinear form 

n 

b(z,w) := y^ ZjWj. 

i = 1 

The (complex) Clifford algebra obtained is denoted Cl n . 

Remark 20.2 The complex Clifford algebra Cl n is based on a complex symmet- 
ric form and not on a Hermitian form. 

Exercise 20.2 Show that for any nonnegative integers p, q with p + q = n we 
have Cl Piq (g> C = C l n . 

Example 20.2 The Clifford algebra based on R 1 itself with the relation x 2 = —1 
is just the complex number system. 

The Clifford algebra construction can be globalized in the obvious way. In 
particular, we have the option of using the form presentation so that the above 
formulas a ■ (3 := a A /3 — (jJa)_i/3 and (3 ■ a := (— l) fc (a A (3 + (j)a)j (3) are 
interpreted as equations for differential forms a £ A 1 T*M and (3 € A k T*M on 
a semi-Riemannian manifold M, g. In any case we have the following 
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Definition 20.9 Given a Riemannian manifold M, g. the Clifford algebra bun- 
dle is Cl(T*M, g) = Cl(T*M) := U x Cl(T*M). 

Since we take each tangent space to be embedded TfMc Cl(T*M), the 
elements 9 l of a local orthonormal frame 9\ ....,9 n £ S2 1 are also local sections 
of Cl(T*M, g) and satisfy 

O' O' + 0 J 0' = W'.O 1 ) = AS 1 ' 

Recall that e 1 ,...,£ n is a list of numbers equal to ±1 (or even 0 if we allow 
degeneracy) and giving the index of the metric g(., .) = (., .). 

Obviously, we could also work with the bundle Cl(TM) := U x Cl(T x M) 
which is naturally isomorphic to Cl(T*M) in which case we would have 

eV +eV = ~(e i ,e j ) = 

for orthonormal frames. Of course it shouldn’t make any difference to our 
development since one can just identify TM with T*M by using the metric. On 
the other hand, we could define Cl(T*M, b) even if b is a degenerate bilinear 
tensor and then we recover the Grassmann algebra bundle A T*AI in case b = 0. 
These comments should make it clear that Cl(T*M,g) is in general a sort of 
deformation of the Grassmann algebra bundle. 

There are a couple of things to notice about Cl{T*M) when we realize it as 
A T*M with a new product. First of all if a, (3 £ A T*M and (a, (3) = 0 then 
a • P = a A /3 where as if (a, P) 0 then in general a/3 is not a homogeneous 
element. Second, Cl(T*M) is locally generated by {1} U {9 1 } U { 9 l 9i : i < 
j} U • • • U { 9 1 9 2 • • • 9 n } where 9 1 ,9 2 , ..., 9 n is a local orthonormal frame. Now we 
can immediately define our current objects of interest: 

Definition 20.10 A bundle of modules over Cl(T*M ) is a vector bundle 
E= (. E,tt,M ) such that each fiber E x is a module over the algebra CKTfM) and 
such that for each 9 £ T(Cl(T* M)) and each a £ T(X) the map x i— > 9(x)a(x) 
is smooth. Thus we have an induced map on smooth sections: Y(Cl(T* M)) x 

r(E)-,r(E). 

Proposition 20.1 The bundle Cl(T*M ) is a Clifford module over itself and 
the Levi Civita connection V on M induces a connection on Cl(T*M) (this 
connection is also denoted V ) such that 

V (cr i rr 2 ) = (Vcri)cr 2 + OiV cr 2 

for all o\, 02 £ T(Cl(T*M)). In particular, ifX,Y £ X(M ) ando £ T(Cl(T*M)) 
then 

V x (Yo ) = (V x Y)o + YV x a. 

Proof. Realize Cl(T*M ) as A T*M with Clifford multiplication and let V 
be usual induced connection on A T*M C ®T*M. We have for an local o.n 
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frame ei, e n with dual frame Q 1 , 0 2 , 9 n . Then Wed 1 = — T* 

Ve(0 i 0 J ') = V((0 i A0 3 ') 

= V^0‘ A + e i A 

= -ri(f)o k a ^ ^ r J k (f)e k a 0 * 

= -r k (t)e k e j - v{(C)6 k o l = (y^)e J + 

The result follows by linearity and a simple induction since a general section a 
can be written locally as a = J2 a hi 2 -..ik 9 ll 9 12 ■ ■ ■ 9 lk . m 

Definition 20.11 Let M, g be a (semi-) Riemannian manifold. A compatible 
connection for a bundle of modules E over Cl(T* M) is a connection V s on 
E such that 

V s (<7 • s) = (Vcr) • S + <7 ■ V s s 
for all s € T(E) and all o € T(Cl(T*M )) 

Definition 20.12 Let E = ( E,n,M ) be a bundle of modules over Cl(T*M) 
with a compatible connection V = V s . The associated Dirac operator is 
defined as a differential operator E on by 

Ds:=Y. di - V l s 

for s G T(E). 

Notice that Clifford multiplication of Cl(T*M) on E = M) is a zeroth 
order operator and so is well defined as a hberwise operation Cl(T*M) x E x — > 

E x - 

There are still a couple of convenient properties that we would like to have. 
These are captured in the next definition. 

Definition 20.13 Let E = ( E,n,M ) be a bundle of modules over Cl(T*M) 
such that E carries a Riemannian metric and compatible connection V = V s . 
We call E = ( E,tt,M ) a Dirac bundle if the following equivalent conditions 
hold: 

1 ) {esi^esf) = (s\, sf) for all si,S 2 G E x and alle £ T*M c Cl(T*M)) with 
|e| = 1. In other words, Clifford multiplication by a unit (co)vector is required 
to be an isometry of the Riemannian metric on each fiber of E. Since , e 2 = — 1 
it follows that this is equivalent to requiring . 

2) (esi,S 2 ) = ~ (si ; es 2 ) for all si,S 2 G E x and all e G TfM C Cl(T*M )) 
with | e | = 1. 

Assume in the sequel that q is nondegenerate. Let denote the subalgebra 
generated by all elements of the form x\ ■ ■ ■ Xk with k even. And similarly, 
Cli(V,q), with k odd. Thus Cl(V,q) has the structure of a Z 2 — graded algebra 
(also called a superalgebra) : 


Cl(V,q) =Cl 0 (V,q)®Ch(V,q) 
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Cl 0 (V,q) ■ Cl 0 (V,q) C C7 0 (V ,q) 

Cl 0 {Y,q)-Ch{Y,q) C Ch(Y,q) 

Ch(y,q)-Ch(V,q) C Cl 0 {V,q) 

Clo(V,q) and Cli(V,q ) are referred to as the even and odd part respectively. 
There exists a fundamental automorphism a. of Cl(V,q) such that a(x) = — x 
for all x eV. Note that a 2 = id. It is easy to see that Clo(V,q) and Cli(Y,q ) 
are the +1 and —1 eigenspaces of a : Cl(V,q ) — > Cl(V,q). 

20.5.2 The Clifford group and Spinor group 

Let G be the group of all invertible elements s € Ck such that sVs -1 =V. 
This is called the Clifford group associated to q. The special Clifford group is 
G + = G fl Co- Now for every s £ G we have a map <j> s : v — > svs for v GV. 

It can be shown that <f> is a map from G into 0{q), the orthogonal group of q. 
The kernel is the invertible elements in the center of Cu- 
lt is a useful and important fact that if x € GflV then q{x) ^ 0 and — (j) x is 
reflection through the hyperplane orthogonal to x. Also, if s is in G + then <fi s 
is in SO(q). In fact, </>(G + ) = SO{q). 

Besides the fundamental automorphism a mentioned above, there is also a 
fundament anti-automorphism or reversion (3 : Cl(V,q) — > Cl(V,q) which is de- 
termined by the requirement that /3(v\V2 ■ ■ ■ Vk) = ■ ■ ■ v\ for V\,V 2 , ---,Vk GVC 

Cl(V,q). We can use this anti-automorphism (3 to put a kind of “norm” on G + ; 

IV; G+ — * K* 

where K* is the multiplicative group of nonzero elements of K and N(s) = (3(s)s. 
This is a homomorphism and if we “mod out” the kernel of N we get the so 
called reduced Clifford group Gj. 

We now specialize to the real case K = K. The identity component of G[ J" is 
called the spin group and is denoted by Spin(V, g). 


20.6 The Structure of Clifford Algebras 

Now if IK = K and 

r r+s 

q(x) = _ 

2=1 i=r -\- 1 

we write G(K r+s , q, K) = Cl(r,s). Then one can prove the following isomor- 
phisms. 

Cl(r + 1, s + 1) = Cl( 1, 1) ® G(r, s) 

Cl(s + 2, r) S Cl{ 2, 0) ® Cl(r, s ) 

Cl{s , r + 2) = Cl{ 0, 2) ® Cl{r , s) 
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Cl(p,p)^(g)Cl( 1,1) 

P 

Clip + k,p ) - 0 Cl(l, 1) 0 C7(fc, 0) 

Cl(k , 0) £* C7(2, 0) <g> C7(0, 2 ) <g> C7(fc - 4, 0) k > 4 
Using the above type of periodicity relations together with 
CZ(2,0) “ CZ(1,1) “ M 2 (R) 

Cl( 1,0) = R ® R. 

and 

C7(0,l) =C 

we can piece together the structure of Cl(r, s ) in terms of familiar matrix alge- 
bras. We leave out the resulting table since for one thing we are more interested 
in the simpler complex case. Also, we will explore a different softer approach 
below. 

The complex case . In the complex case we have a much simpler set of 
relations; 

Cl{2r) * Cl(r,r ) <g> C ^M 2 r(C) 

Cl(2r + 1) S Cl(l) 0 Cl(2r) 

2 Cl(2r) © Cl(2r) SS M 2 .(C)®M 2 r(C) 

These relations remind us that we may use matrices to represent our Clifford 
algebras. Lets return to this approach and explore a bit. 


20.6.1 Gamma Matrices 


Definition 20.14 A set of real or complex matrices 71 , 72 , ..., 7 n are called 
gamma matrices for Cl(r, s ) if 


TiTj + lili = ~ 2 9ij 

where {gij) = diag( 1 , ..., 1 , — 1 , ..., ) is the diagonalized matrix of signature (r, s). 


Example 20.3 Recall the Pauli matrices: 


cr 0 = 


1 

0 



o 1 = 


0 

1 



ct 2 



03 


1 0 
0 -1 


It is easy to see that <ti,<t 2 serve as gamma matrices for 01(0,2) while ia\,ia 2 
serve as gamma matrices for Cl{ 2,0). 

Cl( 2,0) is spanned as a vector space of matrices by <to, icr\, ia 2 , ids and is 
(algebra) isomorphic to the quaternion algebra H under the identification 


°o l— > 1 
ia 1 1 — > / 
ia 2 1 — > J 
*<73 1 — > K 
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Example 20.4 The matrices <7o , 0 i , 02 , 03 are gamma matrices for Cl{ 1,3). 

20.7 Clifford Algebra Structure and Represen- 
tation 

20.7.1 Bilinear Forms 

We will need some basic facts about bilinear forms. We review this here. 

(1) Let E be a module over a commutative ring R. Typically E is a vector 
space over a field IK. A bilinear map g :ExE — > R is called symmetric if 
g(x,y) = g(y,x) and antisymmetric if g(x,y) = —g(y,x) for (x,y) GExE. If 
R has an automorphism of order two, a i— > a we say that g is Hermitian if 
g(ax,y) = ag(x,y) and g(x,ay) = ag(x,y) for all a £ R and (x,y) GExE. If g 
is any of symmetric, antisymmetric, or Hermitian then the “ left kernel” of g is 
equal to the “right kernel” . That is 


ker g = {x € E : g(x, y) = 0 My G E} 

= {y G E : g(x, y) = 0 Mx € E} 

If ker g = 0 we say that g is nondegenerate. In case E is a vector space of 
finite dimension g is nondegenerate iff x i— > g{x, •) GE* is an isomorphism. An 
orthogonal basis for g is a basis {vi} for E such that g(vi, Vi) = 0 for i j. 

Definition 20.15 Let E be a vector space over a three types above. If E=E(BE 2 

for subspaces Ei C E and g{x\,X 2 ) = 0 Vaq GE,x 2 G-Eh then we write 

E = Ei _L E 2 

and say that E is the orthogonal direct sum of E\ and E 2 ■ 

Proposition 20.2 Suppose E,g is as above with 


E = Ei JL E 2 T • • • T Efc 

Then g is non- degenerate iff its restrictions g |e, are and 

kerE = E° _L E° i • • • T E^. 


Proof. Nearly obvious. ■ 

Terminology: If g is one of symmetric, antisymmetric or Hermitian we say 
that g is geometric. 


Proposition 20.3 Let g be a geometric bilinear form on a vector space E (over 
Kj. Suppose g is nondegenerate. Then g is nondegenerate on a subspace F iff 
E= F _L F 1 - where 


F i = {ieE:j(i,/) = 0 V/ef} 
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Definition 20.16 A map q is called quadratic iff there is a symmetric g such 
that q(x) = g(x,x). Note that g can he recovered from q: 

2 g(x, y ) = q(x + y)~ q(x) - q(y) 

20.7.2 Hyperbolic Spaces And Witt Decomposition 

E, g is a vector space with symmetric form g. If E has dimension 2 we call E 
a hyperbolic plane. If dimE> 2 and E=Ei _L E 2 JL ■ ■ ■ XE^ where each Ej is a 
hyperbolic plane for g |e ; then we call E a hyperbolic space. For a hyperbolic 
plane one can easily construct a basis /i ,/ 2 such that g(fi,f) = g{f-2, f2) = 0 
and g(fi, / 2 ) = 1. So that with respect to this basis g is given by the matrix 

'0 1 ' 

1 0 

This pair { /i , /a } is called a hyperbolic pair for E, g. Now we return to dimE> 2. 
Let radF = F 1 - fl F = kerg|^ 

Lemma 20.1 There exists a subspace U C E such that E= radEL U and U is 
nondegenerate. 

Proof. It is not to hard to see that rad U = rad /7 _L . If rad/7 = 0 then 
rad/7 - 1 = 0 and visa vera. Now U+U 1 - is clearly direct since 0 = rad/7 = /7n/7 _L . 
Thus E= U _L /7 _l . ■ 

Lemma 20.2 Let g be nondegenerate and U C E some subspace. Suppose that 
U = radU T W where radW = 0. Then given a basis {iti,-- - ,w s } for radU 
there exists Vi,--- ,v s £ W 1 - such that each is a hyperbolic pair. Let 

Pi = span{ui,Vi} . Then 

E = W 1 Pi i ■ ■ • 1 P s . 

Proof. Let W\ = span{u 2 , W 3 , • • • ,u s } © W. Then 1T| C rad/7 © IT so 
(rad/7 © IT )- 1 C W±. Let w\ G but assume W\ £ (rad/7 © W )~ L . Then we 
have g(ui,wi) 0 so that P\ = spanjui, wu} is a hyperbolic plane. Thus we 
can find v\ such that u± , Vi is a hyperbolic pair for Pi . We also have 

r, = (w 2 .u 3 - ••»„■) .t Pi 7 ir 

so we can proceed inductively since U 2 ,Us, . . .u s G rad/7i. ■ 

Definition 20.17 A subspace U C E is called totally isotropic if g\u = 0. 

Proposition 20.4 (Witt decomposition) Suppose that U CE is a maximal to- 
tally isotropic subspace and ej, e 2,...e r a basis for U. Then there exist (null) 
vectors f±, / 2 , . . . , f r such that each {e*, ff} is a hyperbolic pair and U' = span{f{\ 
is totally isotropic. Further 

E = U ®U' LG 

where G = (U © U') 1 - . 
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Proof. Using the proof of the previous theorem we have radfj = U and 
W = 0. The present theorem now follows. ■ 

Proposition 20.5 If g is symmetric then g\c is definite. 

EXAMPLE: Let E, g = C 2k , g 0 where 


2 k 

9o(z,w) = Y.ZiWi 

i— 1 


Let {ei, ..., efc, efc + i, ..., e 2 fc} be the standard basis of C 2k . Define 


Sj 'j-Xe.j T j 1, A: 


and 

V j = ~ * efe +i)' 

Then letting F = span{ei} , F' = span{? 7 j} we have C 2k = F © F' and F is a 
maximally isotropic subspace. Also, each {e, , rjj } is a hyperbolic pair. 

This is the most important example of a neutral space: 


Proposition 20.6 A vector space E with quadratic form is called neutral if the 
rank, that is, the dimension of a totally isotropic subspace, is r = dimE/2. The 
resulting decomposition F ® F' is called a (weak) polarization. 


20.7.3 Witt’s Decomposition and Clifford Algebras 

Even Dimension Suppose that V, Q is quadratic space over K. Let dimV= r and 
suppose that V, Q is neutral. Then we have that Ck is isomorphic to End (S') for 
an r dimensional space S (spinor space) . In particular, Ck is a simple algebra. 

Proof. Let F@F' be a polarization of V. Here, F and F' are maximal totally 
isotropic subspaces of V. Now let {xi, ..., x r , y%, ..., y r } be a basis for V such that 
{xi} is a basis for F and {yf\ a basis for F' . Set / = y\y% • • -yn- Now let S be 
the span of elements of the form x,;, x l2 ■ ■ ■ x-i r f where 1 < ii < ... < ih < r. S 
is an ideal of Ck of dimension 2 r . We define a representation p of Ck in S by 

p(u)s = us 

This can be shown to be irreducible so that we have the desired result. ■ 

Now since we are interested in spin which sits inside and in fact generates 
Co we need the following 

Proposition 20.7 Co is isomorphic to End(S + ) x End(S~) where S + = Co<lS 
and S~ = Ci fl S'. 
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This follows from the obvious fact that each of Co/ and Cif are invariant 
under multiplication by Cq. 

Now consider a real quadratic space V, Q where Q is positive definite. We 
have Spin(n) C Cl°( 0) C Co and Spin(?i) generates Co- Thus the complex spin 
group representation of is just given by restriction and is semisimple factoring 
as S + © S~ . 

Odd Dimension In the odd dimensional case we can not expect to find a 
polarization but this cloud turns out to have a silver lining. Let Xq be a non- 
isotropic vector from V and set Vi = (ajo) -1 - On Vi we define a quadratic form 

Qi by 

Qi(y) = -Q(xo)Q(y) 

for y eVi. It can be shown that Q i is non-degenerate. Now notice that for 
y eVi then x^y = —yx o and further 

{x 0 y) 2 = - x\y 2 = -Q{x 0 )Q(y) = Qi(y) 

so that by the universal mapping property the map 

y — * x 0 y 

can be extended to an algebra morphism h from Cl(Qi,Vi) to Cr- Now these 
two algebras have the same dimension and since C 0 is simple it must be an 
isomorphism. Now if Q has rank r then Q i,Vi is neutral and we obtain the 
following 

Theorem 20.4 If the dimension of V is odd and Q has rank r then Co is 
represented irreducibly in a space S + of dimension 2 r . In particular Co = 
End(S+). 

20.7.4 The Chirality operator 

Let V be a Euclidean vector space with associated positive definite quadratic 
form Q. Let {ei,...,e n } be an oriented orthonormal, frame for V. We define 
the Chirality operator r to be multiplication in the associated (complexified) 
Clifford algebra by the element 

r = (•\/--l)"' /2 ei • • • e n 

if n is even and by 

r= (V^I)(-+h/2 ei ... en 

if n is odd. Here r G Cl(n) and does not depend on the choice of orthonormal, 
oriented frame. We also have that tv = —vt for v GV and r 2 = 1. 

Let us consider the case of n even. Now we have seen that we can write 
V®C = F ® F where F is totally isotropic and of dimension n. In fact we may 
assume that F has a basis {e 2 j-i — ie 2 j : 1 < j < n/ 2}, where the e,; come 
from an oriented orthonormal basis. Lets use this polarization to once again 
construct the spin representation. 
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First note that Q (or its associated bilinear form) places F and F in duality 
so that we can identify F with the dual space F' . Now set S = AF. First we 
show how V act on S. Given v GV consider v £V®C and decompose v = w + w 
according to our decomposition above. Define (j> w s = \/2w A s and 

(f>u,s = —i(w)s. 

where l is interior multiplication. Now extend <f> linearly to V. Exercise Show 
that <j) extends to a representation of C ® Cl(n). Show that S + = A + F is 
invariant under Co- It turns out that <j> T is (— l) fe on A k F 

20.7.5 Spin Bundles and Spin-c Bundles 

20.7.6 Harmonic Spinors 



Chapter 21 


Complex Manifolds 


21.1 Some complex linear algebra 

The set of all n-tuples of complex C n numbers is a complex vector space and 
by choice of a basis, every complex vector space of finite dimension (over C) 
is linearly isomorphic to C n for some n. Now multiplication by i := y /— 1 is a 
complex linear map C" — > C n and since C" is also a real vector space R 2 " under 
the identification 

(x 1 + iy 1 ,...,x n + iy n )^(x\y\...,x n ,y n ) 

we obtain multiplication by i as a real linear map Jo : R 2 " — » R 2 " given by the 
matrix 

' 0-1 
1 0 

0 -1 
1 0 


Conversely, if V is a real vector space of dimension 2 n and there is a map 
J : V — > V with J 2 = — 1 then we can define the structure of a complex vector 
space on V by defining the scalar multiplication by complex numbers via the 
formula 

(x + iy)v := xv + yJv for v £ V. 

Denote this complex vector space by Vj. Now if e\, ....e n is a basis for Vj (over 
C) then we claim that e\, e n , Jei, Je n is a basis for V over R. We only 
need to show that ei, e n , Jei, Je„ span. For this let v £ V and then 
for some complex numbers c l = a 1 + i6 J we have ^ c * e i = S( aJ + i &) e j = 
a j ej + J2 V Jej. 

Next we consider the complexification of V which is Vc := C®V. Now 
any real basis {fj} of V is also a basis for Vc iff we identify fj with l0/ r 
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Furthermore, the linear map J : V — » V extends to a complex linear map 
J ■ Vc — > Vc and still satisfies J 2 = — 1 . Thus this extension has eigenvalues i 
and — i. Let V 1,0 be the i eigenspace and V 0,1 be the — i eigenspace. Of course 
we must have Vc = V 1,0 © V 0,1 . The reader may check that the set of vectors 
{ei — iJei, e n — i Je n } span V 1,0 while {ei + iJei, e„ + iJe n } span V 0,1 . 
Thus we have a convenient basis for Vc = V 1,0 © V 0-1 . 

Lemma 21.1 There is a natural complex linear isomorphism Vj = V 1,0 given 
by ei ei — i Je*. Furthermore, the conjugation map on Vc interchanges the 
spaces V 1,0 andN 0,1 . 

Let us apply these considerations to the simple case of the complex plane C. 
The realification is R 2 and the map J is 



If we identify the tangent space of R 2n at 0 with R 2 ™ itself then { ^ | Q , gA | 

is basis for R 2 ”. A | complex basis for C n = (R 2n , J 0 ) is, for instance, { | 0 }i<j<n- 

A complex basis for R 2 = C is e\ = gy| Q and so gy | 0 > J gy | 0 is a basis 
for R 2 . This is clear anyway since J g y| 0 = ■ Now the complexifica- 

tion of R 2 is R^. which has basis consisting of e\ — iJei = gy| Q — i g^ and 

ei + iJei=J^| 0 + ig| . These are usually denoted by gy| Q and g|| 0 - More 

generally, we see that if C n is reified to R 2 ™ which is then complexified to 
Rjc" := C © R 2n then a basis for R^." is given by 




d 

d 

d 

5 ’ 

0 

’’ ’ 
u * 0 

dz 1 0 " 



} 


where 

^ d d . d 

dz i 0 ' dx i 0 1 dy i 0 

and 

^ d d . d 

dz 1 0 ' dx l 0 + 1 dy i 0 ' 

Now if we consider the tangent bundle U x R 2n of an open set U C R 2n then 
we have the vector fields gp-, We can complexify the tangent bundle of 
U x R 2 " to get U x Rg" and then following the ideas above we have that the fields 
also span each tangent space T p U := {p} x R^. n . On the other hand, so 
do the fields {gvr, gy^, g|r, gf^}- Now if R 2n had a complex vector space 
structure, say C” = (R 2n , J 0 ), then J 0 defines a bundle map Jo : T p U — > T p U 
given by (p,v) e- > (p, Jfjvj. This can be extended to a complex bundle map 
Jo : TUc = C©TC7 — > TUc = C®TU and we get a bundle decomposition 


TU C = T 10 U © T 01 U 
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where g|r , spans T 10 U at each point and gp-, gp; spans T 01 U. 

Now the symbols gp- etc., already have a meaning a differential operators. 
Let us now show that this view is at least consistent with what we have done 
above. For a smooth complex valued function / : U C C" -4 C we have for 
P = (^i, ....,z n ) G U 

— f= f-i— f\ 

dz l p 2ydx i p J 1 dy l p 1 ) 

1 ( d . d d . . d 

2 ^ dx l p dy l p dx l p dy l p 

1 f du dv \ i f du dv \ 

2 y dx i p + ay 4 p ) + 2 \ p ^ p y ' 

and 


_a^ 

dz l 


f 


1 

2 

1 

2 


1 

2 



/ + i 


_a^ 

ay* 



/ _a^ . _a_ 

Y dx i p U + 1 dy l 


u + 

p 


_a_ 

dx i 


iv + i 

p 


_a_ 

dy i 



( du dv \ 

V dx% p dyi pJ 


1 / du 

2 V dyl p 



Definition 21.1 A function f : U C C n — >■ C is called holomorphic if 

A/^0 (alii) 
on U. A function f is called antiholomorphic if 

(alii). 

Definition 21.2 A map f : U C C n — > C m yiven fry functions /i,...,/ TO is 
called holomorphic (resp. antiholomorphic) if each component function 
is holomorphic (resp. antiholomorphic). 

Now if / : U C C n — > C is holomorphic then by definition ^ 7 | / = 0 for 
all p G 17 and so we have the Cauchy-Riemann equations 

du dv 
dx l dy l 
dv du 

dx l dy l 


(Cauchy-Riemann) 
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and from this we see that for holomorphic / 


df_ 

dz i 

du 

dx i 

= df_ 

dx i 


. dv 
1 dx i 


which means that as derivations on the sheaf O of locally defined holomorphic 
functions on C n , the operators and are equal. This corresponds to the 
complex isomorphism T 10 U = TU, Jq which comes from the isomorphism in 
lemma ??. In fact, if one looks at a function / : R 2n — > C as a differentiable 
map of real manifolds then with J 0 given the isomorphism R 2n = C n , out map 
/ is holomorphic iff 

TfoJ 0 = J 0 oTf 

or in other words 



This last matrix equation is just the Cauchy-Riemann equations again. 


21.2 Complex structure 

Definition 21.3 A manifold M is said to be an almost complex manifold 

if there is a smooth bundle map J : TM — » TM , called an almost complex 
structure , having the property that J 2 = —1. 

Definition 21.4 A complex manifold M is a manifold modelled on C" for 
some n, together with an atlas for M such that the transition functions are all 
holomorphic maps. The charts from this atlas are called holomorphic charts. 
We also use the phrase “ holomorphic coordinates'’ . 

Example 21.1 Let S 2 ( 1/2) = {(aq, X 2 , £ 3 ) € R 3 : x\ + x\ + x 2 = 1/4} be given 
coordinates ip + : (x\,X 2 ,X 3 ) 1 — > (aq + iaj 2 ) f C on U + := {(aq, X 2 , * 3 ) €E 
S 2 : 1 — X 3 yf 0} and %l>~ : (xi,X 2 ,X 3 ) 1 — > 1 _ ) } a , 3 {x\ + 1 x 2 ) G C on U~ := 
{(aq, X 2 , X 3 ) £ S 2 : 1 + 13 / 0}. Then z and w are coordinates on S 2 { 1/2) 
with transition function o ip + ( z ) = I/z. Since on 1 f + U + (~l ip~U ~ the map 
z 1 — > 1/z is a biholomorphism we see that S ' 2 (1/2) can be given the structure of 
a complex 1 -manifold. 

Another way to get the same complex 1-manifold is by taking two copies 
of the complex pane, say C z with coordinate z and C w with coordinate z and 
then identify C 2 with C w — {0} via the map w = \/z. This complex surface 
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is of course topologically a sphere and is also the 1 point compactification of 
the complex plane. As the reader will not doubt already be aware, this complex 
1-manifold is called the Riemann sphere. 

Example 21.2 Let P n { C) be the set of all complex lines through the origin in 
C n+1 which is to say the set of all equivalence classes of nonzero elements of 
C ra+1 under the equivalence relation 

{z\...,z n+1 )~X(z\...,z n+1 ) for X£C 

For each i with 1 < i < n + 1 define the set 

U i :={[z\...,z n+1 ] eP„(C) :2V0} 

and corresponding map ipi : Ui — > C" by 

U[z\...,z n+1 }) = - i (z\...,?,...,z n+1 )€C n . 
z 

One can check that these maps provide a holomorphic atlas for P n ( C) which is 
therefore a complex manifold (complex projective n-space). 

Example 21.3 Let C"‘ be the space ofmxn complex matrices. This is clearly 
a complex manifold since we can always “line up ” the entries to get a map 
C™ — > C mn and so as complex manifolds C™ = C mn . A little less trivially we 
have the complex general linear group GL(n,C) which is an open subset of C” 
and so is an n 2 dimensional complex manifold. 

Example 21.4 (Grassmannian manifold) To describe this important exam- 
ple we start with the set (C)?)* of n x k matrices with rank k < n (maximal 
rank). The columns of each matrix from (C^)* span a k-dimensional subspace 
of C". Define two matrices from (C^)^ to be equivalent if they span the same 
k-dimensional subspace. Thus the set G(k,n) of equivalence classes is in one 
to one correspondence with the set of complex k dimensional subspaces of C n . 
Now let U be the set of all [A] £ G(k , n ) such that A has its first k rows linearly 
independent. This property is independent of the representative A of the equiv- 
alence class [A] and so U is a well defined set. This last fact is easily proven 
by a Gaussian reduction argument. Now every element [A] £ U C G(k , n) is an 
equivalence class that has a unique member Aq of the form 

f Lkxk ) 

1 * )' 

Thus we have a map on U defined by 4' : [A\ i— > Z £ C’(~ k = C k ^ n ~ k K We wish 
to cover G(k,n) with sets U a similar to U and defined similar maps. Let a 
be the shuffle permutation that puts the k columns indexed by i\,...,ik into the 
positions 1 . .... k without changing the relative order of the remaining columns. 
Now consider the set Ui 1 ...i k of all [A] £ G(k,n ) such that any representative 
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A has its k rows indexed by linearly independent. The the permuta- 
tion induces an obvious 1-1 onto map from Ui 1 ...i k onto U = 

We now have maps : Ui 1 ...i k — »■ = C k ( n - k ) given by composition 

^ii...ik ■= ^ ° (7h...i k - These maps form an atlas {d ’ i i _...i k ,Ui 1 .., ik } for G(k,n ) 
which turns out to be a holomorphic atlas (biholomorphic transition maps) and 
so gives G(k , n) the structure of a complex manifold called the Grassmannian 
manifold of complex k-planes in C n . 

Definition 21.5 A complex 1-manifold ( so real dimension is 2) is called a 

Riemann surface. 

If 5 is a subset of a complex manifold M such that near each po £ S there 
exists a holomorphic chart U,ip = (z 1 , ■■■,z n ) such that 0 € S fl U iff z k+1 (p ) = 
• • • = z n (p) = 0 then the coordinates z 1 , ...,z k restricted to U fl S is a chart 
on the set S and the set of all such charts gives S the structure of a complex 
manifold. In this case we call S a complex submanifold of M. 


Definition 21.6 In the same way as we defined differentiability for real man- 
ifolds we define the notion of a holomorphic map (resp. antiholomorphic 
map) from one complex manifold to another. Note however, that we must use 
holomorphic charts for the definition. 

The proof of the following lemma is straightforward. 


Lemma 21.2 Let ip : U 

ip = (z 1 , ..., z n ) and z k = 
by 


C n be a holomorphic chart withp £ U. 


x k +iy k we have that the map J p 


T P M 


Then writing 
T p M defined 


Jp 



_d_ 

dy i 


Jp 



_d_ 

dx i 


p 


is well defined independent of the choice of coordinates. 


The maps J p combine to give a bundle map J : TM — + TM and so an 
almost complex structure on M called the almost complex structure induced by 
the holomorphic atlas. 


Definition 21.7 ^4n almost complex structure J on M is said to be integrable 
if there it has a holomorphic atlas giving the map J as the induced almost 
complex structure. That is if there is an family of admissible charts ip a : U a — >■ 
R 2 " such that after identifying R 2n with C n the charts form a holomorphic atlas 
with J the induced almost complex structure. In this case, we call J a complex 
structure. 
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21.3 Complex Tangent Structures 

Let T p { C) denote the algebra germs of complex valued smooth functions at p 
on a complex n-manifold M thought of as a smooth real 2n— manifold with real 
tangent bundle TM. Let Der p (jF) be the space of derivations this algebra. It is 
not hard to see that this space is isomorphic to the complexified tangent space 
T p Mc = CCiTpM . The (complex) algebra of germs of holomorphic functions at 
a point p in a complex manifold is denoted O p and the set of derivations of this 
algebra denoted Der p (0). We also have the algebra of germs of antiholomorphic 
functions at p which is O p and also Der p (0). 

If ip : U — > C n is a holomorphic chart then writing ip = (z 1 , ..., z n ) and 
z k = x k + i y k we have the differential operators at p £ U: 


d 

dz 1 


d 

dz 1 


(now transferred to the manifold). To be pedantic about it, we now denote the 
coordinates on C" by u;,; = Uj + i Vj and then 


d 

~dz i 


p 


/:= 


d 

~d¥ 


p 




dfoip 


-l 


dw i 

df o ip~ 1 


duo 1 


4>(p) 


4>(p) 


Thought of derivations these span Der p (J 7 ) but we have also seen that they span 
the complexified tangent space at p. In fact, we have the following: 


d 

TpMc = span c —. 


d_ 

d¥ 


= Der P (F) 


TpM 1 ’ 0 = span c <( 


d 

dz 1 


T p M 0,1 = span c 


= {#£ Der p (J 7 ) : vf = 0 for all / € O p } 
d 


dz 1 


= {u G Der p (iF) : vf = 0 for all / € O p } 


and of course 


d 

T p M = span R ~ i 


_d_ 

dy i 


The reader should go back and check that the above statements are consistent 
with our definitions as long as we view the jpx\ p , gfr| p not only as the alge- 
braic objects constructed above but also as derivations. Also, the definitions of 
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T P M 1 ’° and TpM 0 ’ 1 are independent of the holomorphic coordinates since we 
also have 

TpM 1 ’ 0 = ker{ J p : T p M -* T p M} 

21.4 The holomorphic tangent map. 

We leave it to the reader to verify that the construction that we have at each 
tangent space globalize to give natural vector bundles TMc,TM 1,0 and TM 0,1 
(all with M as base space). 

Let M and N be complex manifolds and let / : M — > N be a smooth map. 
The tangent map extend to a map of the complexified bundles Tf : TM , c — > 
TN C . Now TM C = TM 1 ’°®TM 0 ’ 1 and similarly TM C = TN 1 ’°®TN 0,1 . If / is 
holomorphic then T f(T p M 1,0 ) C Tf^N 1,0 . In fact since it is easily verified that 
T /(TpM 1 ’ 0 ) C Tf( p -)N 1,0 equivalent to the Cauchy-Riemann equations being 
satisfied by the local representative on F in any holomorphic chart we obtain 
the following 

Proposition 21.1 T f(T p M 1,0 ) C Tf^N 1,0 if and only if f is a holomorphic 
map. 

The map given by the restriction T p f : TpM 1 ’ 0 — » TfpN 1 ’ 0 is called the 
holomorphic tangent map at p. Of course, these maps concatenate to give a 
bundle map 


21.5 Dual spaces 

Let M,J be a complex manifold. The dual of T p Mc is T*Mq = C ® T*M. 
Now the map J has a dual bundle map J* : T*Mc — ► T*Mc which must also 
satisfy J* o J* = —1 and so we have the at each p £ M the decomposition by 
eigenspaces 

T*M C = T*M 1,0 © T^M 0 ’ 1 
corresponding to the eigenvalues ±i. 

Definition 21.8 The space T*M 1,0 is called the space of holomorphic co-vectors 
at p while T^M 0 ’ 1 is the space of antiholomorphic co-vector at p. 

We now choose a holomorphic chart if : U — > C" at p. Writing if = 
(z 1 , ..., z n ) and z k = x k + i y k we have the 1-forms 

dz k = dx k + i dy k 
and 

dz k = dx k — \dy k . 
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Equivalently, the pointwise definitions are dz k I = dx k I + i dy k I and dz k I = 

x oil | p Ip ^ Ip Ip 

dx k I — i dy k I . Notice that we have the expected relations: 

I « ^ I r» A 


dz k ( —) = (dx k + idr/ fc ) f — — i 1 — ) 

+m y A 2 dxi 2dt } 

= \ s " + \ st i = «* 

dz fc (^) = {dx k + idy k )(--^ + i--^r) 
v y A 2dx* 2<9y ! ' 

= 0 

and similarly 

ds ‘ ( ^) = ^ “ d < > = A 

Let us check the action of J* on these forms: 

J ‘ (d2 ‘ )( ^ ) = J ‘ (dI ‘ + id/)( 5? ) 

= (dx k + \dy k ){jA ) 


dz i 


= i(dx k + idy t )A 




and 


r(^) ( A) = ^pA) 

= -i^(| !) = » = 

Thus we conclude that <iz fc | € TIM 1 ’ 0 . A similar calculation shows that 

Ip P 

dz fc | € TIM 0 ’ 1 and in fact 

Ip P 


T*M 1,0 = span | dz k \ p : k = 1, n| 
T*AI 0,1 = span | dz fc |^ : k = 1, n| 


and {dz 1 , dz"| > dz 1 , cfo"| } is a basis for T*Mc- 


Remark 21.1 If we don’t specify base points then we are talking about fields 
(over some open set) which form a basis for each fiber separately. These are 
called frame fields (e.g. r) or co-frame fields (e.g. dz k ,dz k ). 
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21.6 Examples 

21.7 The holomorphic inverse and implicit func- 
tions theorems. 

Let and (w 1 , be local coordinates on complex manifolds M 

and N respectively. Consider a smooth map f : M —> N. We suppose that 
p £ M is in the domain of (z 1 , z n ) and that q = f(p) is in the domain of the 
coordinates (w 1 , Writing z* = x l + i y l and w l = u l + iv l we have the 

following Jacobian matrices: 


1. In we consider the underlying real structures then we have the Jacobian 

‘ d 
dy i 


given in terms of the frame gf*, A and Jr, Jj- 



d S(p) 

1 &(p) 

i£(p) 

i£(p) •• 



S(p) 

i£(p) 

§£(P) 

i£(p) •• 


Jp(f) = 

d S(p) 

1 £(p) 

d S(p) 

#(p) •• 



SJ(p) 

f$p) 

S(p) 

g(p) •• 


With respect to the bases J , 

, Jr and 

d d 

dw % ’ dw l 

we have 



J P ,c(f) 


Jll J 12 

J 12 J22 


where the J t j are blocks of the form 


dw z dw z 
dzj. dzj. 
dw z dw l 
dzi dzi 


If / is holomorphic then these block reduce to the form 

dw z 
dzi 

0 

It is convenient to put the frame fields in the order g|r, g|^, g|t^;..., g|^ 
and similarly for the -JC , . , . In this case we have for holomorphic f 

J OW 7 OW z 1 J 

' J 1 - 0 0 ' 

0 J 1 ’ 0 


0 

dw z 

dzi 


Jp.c(f) 
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where 


We shall call a basis arising from a holomorphic coordinate system “sepa- 
rated” when arranged this way. Note that J 1,0 is just the Jacobian of the 
holomorphic tangent map T 1,0 / : T 1,0 M — > T 1,0 7V with respect to this 

the holomorphic frame . 

We can now formulate the following version of the inverse mapping theorem: 

Theorem 21.1 (1) Let U and V be open set in C" and suppose that the map 
f : U — > V is holomorphic with J 1,0 (/) nonsingular at p £ U. Then there 
exists an open set Uq C U containing p such that f\ Ug : Uq — > f(Uo) is a 1-1 
holomorphic map with holomorphic inverse. That is, f\ Ug is biholomorphic. 

(2) Similarly, if f : U V is holomorphic map between open sets of complex 
manifolds M and N then ifT k ’°f : Tp’°M — > Tj^ N is a linear isomorphism 
then f is a biholomorphic map when restricted to a possibly smaller open set 
containing p. 

We also have a holomorphic version of the implicit mapping theorem. 

Theorem 21.2 (1) Let f : U C C" — > V C C fc and let the component functions 
of f be fi,...,fk ■ If Jp'°(f) has rank k then there are holomorphic functions 
g 1 , g 2 , ..., g k defined near 0 £ C n ~ k such that 

f(z\...,z n )=p 

z° = g J (z k+1 ,...,z n ) for j = 1 ,..,k 

(2) If f : M — > N is a holomorphic map of complex manifolds and if for 
fixed q £ N we have that each p £ f~ 1 (p) is regular in the sense that T^' 0 f : 
Tp ,0 M — » TjpN is surjective, then S := f~ 1 (p) is a complex submanifold of 
(complex) dimension n — k. 

Example 21.5 The map p : C" +1 — > C given by (z 1 , ... ,z n+1 ) t— > (z 1 ) 2 + • • • + 
(. z n+ 1 ) 2 has Jacobian at any (z 1 , ..., z n+1 ) given by 

[ 20 1 2z 2 ••• 2 z n+1 ] 

which has rank 1 as long as (z 1 , ..., z n+1 ) ^ 0. Thus ^^(l) is a complex 
submanifold ofC n+1 having (complex) dimension n. Warning: This is not the 
same as the sphere given by \z x \ +■ ■ ■+ |z n+1 | = 1 which is a real submanifold 
of C" +1 = K 2n+2 of real dimension 2n + 1. 




dw l 

~dzi 

dw l 


dzl 
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Chapter 22 

Classical Mechanics 


Every body continues in its state of rest or uniform motion in a straight line, except 

insofar as it doesn’t. 

Arthur Eddington, Sir 


22.1 Particle motion and Lagrangian Systems 

If we consider a single particle of mass m then Newton’s law is 

d 2 x _. . . . 

m-jjj = F(x(t),t) 

The force F is conservative if it doesn’t depend on time and there is a potential 
function V : R 3 — > R such that F(x) = — gradF(x). Assume this is the case. 
Then Newton’s law becomes 

d 2 

+ grad V (x(f)) = 0. 

Consider the Affine space C(/,x i,X 2 ) of all C 2 paths from xi to X 2 in K 3 
defined on the interval I = [ti,t 2 ]. This is an Affine space modelled on the 
Banach space Cq(I) of all C r functions e : I — > K 3 with e(ti) = e{t\) = 0 and 
with the norm 

Ikll =sup{|e(t)| + |£ , (t)| + |£ ,, (t)|}. 

tei 

If we define the fixed affine linear path a : I — > K 3 by a(t) = xj + (X2 — Xi) 

then all we have a coordinatization of (7(7, xi, X 2 ) by Cq(I) given by the single 
chart ^ : c n c - a e d 0 2 (I). Then the tangent space to C(/, x i,x 2 ) at a fixed 
path Co is just Cq(I). Now we have the function S defined on C(/, x i,x 2 ) by 

S( c ) = £ Qw||c'(t )|| 2 - F(c {t))jdt 
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The variation of S is just the 1-form SS : Cq(I) — > R. defined by 

d 


SS-e = 


dr 


S(C 0 + T£) 


T— 0 


Let us suppose that 6S = 0 at c 0 . Then we have 
d 


0 = 


dr 

_d_ 

dr 


S(c' 0 (t) + T£) 


T—0 


T— 0 J 1 1 


\ m W c 'o(t) + T£\t)f -V(c 0 (t)+Te(t)) ) dt 


C 2 / 3V dr i \ 

J t [m(c' 0 (t),e\t))-Q x .(c 0 ) — (0))dt 

Jt ' 0 (t) ■ ^ £ (t) - gradV(c 0 (t)) ■ e(t)^j dt 


rt-2 


(mcg(t) — grady(c 0 (<))) • e(t)dt 


Now since this is true for every choice of e gCq(I) we see that 

mc'g(t) - grad V(c 0 (t.)) = 0 

thus we see that c 0 (t) = x(t) is a critical point in C(l,x i,x 2 ), that is, a sta- 
tionary path, iff ?? is satisfied. 


22.1.1 Basic Variational Formalism for a Lagrangian 

In general we consider a differentiable manifold Q as our state space and then 
a Lagrangian density function L is given on TQ. For example we can take a 
potential function V : Q — > R, a Riemannian metric g on Q and define the 
action functional S on the space of smooth paths I — » Q beginning and ending 
at a fixed points pi and P 2 given by 


5(c) = f L (c'(t))dt = 

Jt i 


-to (c'(t), c'(t)) - V(c(t))dt 


The tangent space at a fixed Co is the Banach space TI(cqTQ) of C 2 vector 
fields £ : I — > TQ along Co which vanish at t\ and £ 2 - A curve with tangent e 
at cq is just a variation v : (— e, e) x I — > Q such that e(t)= ^| S=0 W ( S > t) is the 
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variation vector field. Then we have 


SS-e = y 
ds 


5=0 
t2 Q 


ds 


jO(M 


T , dv , . 


s=0 


= etc. 


dt 

dt 


Let us examine this in the case of Q = U C R". With q = (g 1 , ...g”) being 
(general curvilinear) coordinates on U we have natural ( tangent bundle chart) 
coordinates q, q on TU = U x R™ . Assume that the variation has the form 
q(s, t) = q(f) + se(t). Then we have 


6S ■ £ 


6 > 

ds s 

ft 2 g 

t , ds 


ot2 


=0 Jtx 


L(q(s,t)A(sA))dt 
L(q + S£, q + si)dt 


s=0 


f t2 ,d L. dL, .. .. , 

= /. < a5 <q ' q) ' £ + a5 (q ' q )' E)<il 

[ t2 ,d L. .. d dL ms 
= 1 <a,< q ' q >' £ -Sa4 (q ' q) ' £)<i< 
f t2 dL dSL, ... , 

-L 


and since e was arbitrary we get the Euler-Lagrange equations for the motion 


dL 

dq 


(q, q) 


d dL 
dt dq 


(q, q) = o 


In general, a time-independent Lagrangian on a manifold Q (usually rep- 
resenting the position space of a particle system) is a smooth function on the 
tangent bundle (velocity space): 


L : TQ -► Q 


and the associated action functional is a map from the space of smooth curves 
C'°°([a, b],Q) defined for c : [a, b] — > Q by 

Sl{c) = f L (c(t))dt 
J a 

where c : [a, 6] — * TQ is the canonical lift (velocity). A time dependent La- 
grangian is a smooth map 

L : R x TQ -> Q 

where the first factor R is the time t, and once again we have the associated 
action functional Sl(c) = f ^ L(t, c{t))dt. 

Let us limit ourselves initially to the time independent case. 
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Definition 22.1 A smooth variation of a curve c : [a,b\ — > Q is a smooth map 
v : [a, b] x (— e, e) — > Q for small e such that v(t, 0) = c(t). We call the variation 
a variation with fixed endpoints if v(a, s ) = c(a) and v(b, s ) = c(b) for all 
s € (— e,e). Now we have a family of curves v s = The infinitesimal 

variation at is the vector field along c defined by V(t) = 4^(f, 0). This V 
is called the variation vector field for the variation. The differential of the 
functional 6 Sl (classically called the first variation) is defined as 


SS L {c ) • V 


d_ 

ds 

d 

ds 


S-l{v s ) 


s=0 


L {v s {t))dt 


s=0 J a 


Remark 22.1 Every smooth vector field along c is the variational vector field 
coming from some variation of c and for any other variation v' with V ( t ) = 
qfr(t, 0) the about computed quantity SSl(c) ■ V will be the same. 


At any rate, if 5Sl{c) ■ V = 0 for all variations vector fields V along c and 
vanishing at the endpoints then we write 6Sl(c) = 0 and call c critical (or 
stationary) for L. 

Now consider the case where the image of c lies in some coordinate chart 
U, if = q 1 , q 2 , ...q n and denote by TU,Tip = (q 1 , q 2 , q n , q 1 , q 2 , q n ) the nat- 
ural chart on TU C TQ. In other words, Tf>(t;) = (q 1 o r(£), q 2 o r(£), ..., q n o 
r (£)> dq 1 ^), dq 2 (f ), ..., dq n (£)). Thus the curve has coordinates 

(c,c) = (q 1 (t),q 2 (t),...,q n (t),q 1 (t),q 2 (t),... i q n (t)) 


where now the q l (t) really are time derivatives. In this local situation we can 
choose our variation to have the form q l (t) + s5q l (t) for some functions 5q l (t) 
vanishing at a and b and some parameter s with respect to which we will differ- 
entiate. The lifted variation is (q(£) + sd(t), q(f) + sd>q(f)) which is the obvious 
abbreviation for a path in Tif>(TU) C R" x R". Now we have seen above that 
the path c will be critical if 


d_ 

ds 


L(q(f) + s6(t), q(f) + sSq(t)))dt = 0 


s=o • 


for all such variations and the above calculations lead to the result that 


d_ 

<9q 


L(q(f),q(f)) 


d L d_ 

dt dq 


L(q(f),q(t)) = 0 


Euler-Lagrange 


for any L-critical path (with image in this chart). Here n = dim(<5). 

It can be show that even in the case that the image of c does not lie in the 
domain of a chart that c is L-critical path if it can be subdivided into sub-paths 
lying in charts and L-critical in each such chart. 
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22.1.2 Two examples of a Lagrangian 

Example 22.1 Suppose that we have a 1-form 9 £ X*(Q). A 1-form is just a 
map 8 : TQ — » R that happens to be linear on each fiber T p Q. Thus we may 
examine the special case ofL = 9. In canonical coordinates (q,q) again, 


L = 9 = y^ j a i (q)dq l 


for some functions ai(q). 
equations become 


An easy calculation shows that the Euler-Lagrange 



dau 

dq l 


q l = 0 


but on the other hand 


d9 = 


1 \ ^ / dctj 

2 ^ Qqi 


tO 8 '^ 


and one can conclude that if c = ( q l (t )) is critical for L = 9 then for any vector 
field X defined on the image of c we have 

or d9(c(t), X) = 0. This can be written succinctly as 


Cc(t)d9 0. 


Example 22.2 Now let us take the case of a Riemannian manifold M, g and 
let L(ti) = ig(i>,ii). Thus the action functional is the “energy” 

S g {c) = J g(c(t),c(t))dt 

In this case the critical paths are just geodesics. 


22.2 Symmetry, Conservation and Noether’s The- 
orem 

Let G be a Lie group acting on a smooth manifold M. 

A : G x M -> M 

As usual we write g ■ x for A {g,x). We have a fundamental vector field if 
associated to every £ € g defined by the rule 

£*(p)=T(e lP) A- CO) 
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or equivalently by the rule 

£{p)=j t ex P (tO'P 

The map t| : £ i— > ^ is a Lie algebra anti-homomorphism. Of course, here we are 
using the flow associated to £ 

FI 5 (f,p) := FI t\t,p) = exp (tfi) ■ p 

and it should be noted that t 1 — > exp(t£) is the one parameter subgroup associ- 
ated to f and to get the corresponding left invariant vector field £ X L (G) 
we act on the right: 

x Hg) = ^ . 9 -exp(f£) 

Now a diffeomorphism acts on a covariant k-tensor field contravariantly accord- 
ing to 

{4>*K){p){v i, ...v k ) = K(cj)(p))(T<j)Vi, ...T<jyu k ) 

Suppose that we are given a covariant tensor held T £ T (M) on M. We think 
of T as defining some kind of extra structure on M. The two main examples 
for our purposes are 

1. T = (.,.) a nondegenerate covariant symmetric 2-tensor. Then M, (., .) is 
a (semi-) Riemannian manifold. 

2. T = to £ 0 2 (M) a non-degenerate 2-form. Then M, u> is a symplectic 
manifold. 

Then G acts on T since G acts on M as diffeomorphisms. We denote this 
natural (left) action by g ■ T. If g ■ T = T for all g £ G we say that G acts by 
symmetries of the pair M, T. 

Definition 22.2 In general, a vector field X on M, T is called an infinitesi- 
mal symmetry of the pair M, T if C\L = 0. Other terminology is that X is a 
T — Killing field . The usual notion of a Killing field in (pseudo-) Riemannian 
geometry is the case when T = (, ) is the metric tensor. 

Example 22.3 A group G is called a symmetry group of a symplectic manifold 
M, u> if G acts by symplectomorphisms so that g • lo = u> for all g £ G. In this 
case, each £ £ g is an infinitesimal symmetry of M , lo meaning that 

C^u) = 0 

where C j is by definition the same as Cc \ . This follows because if we let g t = 
exp (tfi) then each gt is a symmetry so g)co = 0 and 

C^lo = ^ g* t u = 0 
at 0 
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22.2.1 Lagrangians with symmetries. 

We need two definitions 


Definition 22.3 If <j> : M — > M is a diffeomorphism then the induced tangent 
map T(j> : TM — » TM is called the canonical lift. 


Definition 22.4 Given a vector field X G X(M) there is a lifting of X to 
X G X(TM) = r(TM, TTM) 


X : 

TM — s 

■ TTM 


1 

i 

X : 

M — > 

TM 


such that the flow F1 Y is the canonical lift of F1 A ' 


Flf : 

TM - 

-> TM 

Flf : 

1 

l 

M -> 

M 


In other words, F1 A = TF1 A . We simply define X(v) = ^(TF1 A • v ). 


Definition 22.5 Let ojl denote the unique 1-form on Q which in canonical co- 
ordinates is wl = £" =1 

Theorem 22.1 (E. Noether) If X is an infinitesimal symmetry of the La- 
grangian then the function u>i,(X) is constant along any path c : I Cl that is 
stationary for the action associated to L. 


Let’s prove this using local coordinates {(ft , eft) for TU a C TQ. ft turn out 
that locally, 


* = E 


iJL 

dq l 


E 


da 1 d 
dqJ deft 


where a 1 is defined by X = £ a l (q) ftft, . Also, lol(X) = £a*|^-. Now suppose 
that q z (t),q l (t ) = \q l {t ) satisfies the Euler-Lagrange equations. Then 


-w L (X)(<,‘((), «'(*)) 


dt 

= E j t 


deft 

9L 

’dft 


dt deft 


= E 




dq l 


dq 1 


= dL{X) = C x L = 0 
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This theorem tells us one case when we get a conservation law. A conserva- 
tion law is a function C on TQ (or T*Q for the Hamiltonian flow) such that C 
is constant along solution paths, (i.e. stationary for the action or satisfying the 
E-L eqns.) 

L : TQ -> Q 

let X G T(TQ). 


22.2.2 Lie Groups and Left Invariants Lagrangians 

Recall that G act on itself by left translation l g : G — > G. The action lifts to 
the tangent bundle Tl g : TG — » TG. Suppose that L : TG — > K. is invariant 
under this left action so that L{Tl g Xh) = L(X p ) for all g,h G G. In particular, 
L (Tl g X e ) = L{X e ) so L is completely determined by its restriction to T e G = g. 
Define the restricted Lagrangian function by A = L\ T G . We view the differential 
dA as a map dA : g — > ffi. and so in fact dX G g*. Next, recall that for any (Gg 
the map ad^ : g — > g is defined by ad^n = [£, v\ and we have the adjoint map 
adc : g* — > g* . Now let t i— > g(t) be a motion of the system and define the 
“body velocity” by v c {t) = T7 c (t)-i • c' (t) = UG(c'(t)). Then we have 

Theorem 22.2 Assume L is invariant as above. The curve c(.) satisfies the 
Euler-Lagrange equations for L iff 

^dA(v c {t)) = ad* c(i) dA 


22.3 The Hamiltonian Formalism 


Let us now examine the change to a description in cotangent chart q, p so that 
for a covector at q given by a(q)-dq has coordinates q, a. Our method of transfer 
to the cotangent side is via the Legendre transformation induced by L. In fact, 
this is just the fiber derivative defined above. We must assume that the map 
F : (q, q) i— > (q, p) = (q,|4(q, q)) is a diffeomorphism (this is written with 
respect to the two natural charts on TU and T*U ). Again this just means 
that the Lagrangian is nondegenerate. Now if v(t) = (q(£),q(t)) is (a lift of) a 
solution curve then defining the Hamiltonian function 

~ dL 

H(q.q) = ^v(q,q)-q-L(q,q) 


we compute with q =Jyq 


d 

dt 


H{ q, q) 


S<| O.4)-<0-S L ( q ’« 

dL . .. d . d dL , d 

m iq - q, di q+ ^ai iq - q>q -dt Llq ’ q> 


= 0 
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we have used that the Euler-Lagrange equations (q, q) — ^ (q, q) = 0. Thus 

differential form dH = ^dq+^dq is zero on the velocity v'(t) = ^(q, q) 

dH ■ v'(t) = dH ■ ^(q,q) 

dH dq dH dq 

= — H = 0 

<9q dt dq dt 

We then use the inverse of this diffeomorphism to transfer the Hamiltonian 
function to a function H( q, p) = F~ 1 *H( q, q) = p • q(q, p) — L(q, q(q, p)).. 
Now if q(f),q(t) is a solution curve then its image 6(f) = Fov(t) = (q(t),p(t)) 
satisfies 


dH(b'(t)) = (dH ■ TF.v'(t )) 
= (F*dH) ■ v'(t) 
= d(F*H) ■ v'(t) 
= dH ■ v'(t) = 0 


so we have that 


but also 


0 = dH(b'(t )) 


dH dq dH dp 

dq dt dp dt 


d_ 

dp 


H( q,p) = q+p- 


dq 

dp 


dL dq . dq 

dq dp ^ dt 


solving these last two equations simultaneously we arrive at Hamilton’s equa- 
tions of motion: 


d , . dH , , . ... 

S q(t)= af (q(t) ’ p( ‘» 

| p( i) = -|K ( q ( t ) .p ( ( )) 


or 






d P 


Remark 22.2 One can calculate directly that yw(q(t), p(t)) = 0 for solutions 
these equations. If the Lagrangian was originally given by L = — V for 

some kinetic energy function and a potential energy function then this amounts 
to conservation of energy. We will see that this follows from a general principle 
below. 
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Chapter 23 

Symplectic Geometry 


Equations are more important to me, because politics is for the present, 
but an equation is something for eternity 
-Einstein 


23.1 Symplectic Linear Algebra 

A (real) symplectic vector space is a pair V,a where V is a (real) vector 
space and a is a nondegenerate alternating (skew-symmetric) bilinear form a, : 
V x V — > R. The basic example is R 2n with 

«o (x,y) = x t J n y 

where 

t ( 0 t n xn \ 

" ~ \ -Iuxn 0 ) ■ 

The standard symplectic form on a o is typical. It is a standard fact from linear 
algebra that for any N dimensional symplectic vector space V, a there is a 
basis e%, e n , f 1 1 /” called a symplectic basis such that the matrix that 
represents a with respect to this basis is the matrix J n . Thus we may write 

ol = e 1 A fi + ... + e n A /„ 

where e 1 , e n , /i, ..., /„ is the dual basis to ei, e„, f 1 , f n . If V,r] is a 
vector space with a not necessarily nondegenerate alternating form 77 then we 
can define the null space 

N v = {u € V : r](v, w) = 0 for all w £ V}. 

On the quotient space V = V /N v we may define rj(v, w) = rj(v , w) where v and 
w represent the elements v, W G V. Then V, fj is a symplectic vector space called 
the symplectic reduction of V, 77. 
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Proposition 23.1 For any 77 G /\V* (regarded as a bilinear form) there is 
linearly independent set of elements e 1 , ..., e k , /i , fk from V* such that 

?7 = e 1 A /i + ... + e k A f k 


where dim(V) — 2 k > 0 is the dimension of N v . 

Definition 23.1 Note: The number k is called the rank of ip The matrix that 
represents 77 actually has rank 2k and so some might call k the half rank of ip 

Proof. Consider the symplectic reduction V, fj of V,ry and choose set of 
elements e 1 , ..., e k , /i, ..., fk such that e 1 , ..., e k , fi, ..., fk form a symplectic ba- 
sis of V, rp Add to this set a basis a basis for N v and verify that 

e 1 , e k , fi, ..., fk,bi, ..., bi must be a basis for V. Taking the dual basis one can 
check that 

?7 = e 1 A fi + ... + e k A fk 

by testing on the basis e 1 , ..., e k , /i, ..., f k , £>i , ..., bp m 

Now if W is a subspace of a symplectic vector space then we may define 

W ± = {» £ V : T](v, w) = 0 for all w G W} 

and it is true that dim(W) + dim(W ± ) = dim(V) but it is not necessarily 
the case that W n = 0 . In fact, we classify subspaces W by two numbers: 
d = dim(W) and v = dim(W fl W 2 -). If v = 0 then ?7| w , W is a symplectic 
space and so we call W a symplectic subspace . At the opposite extreme, if 
v = d then W is called a Lagrangian subspace . If W C W 2 - we say that W 
is an isotropic subspace. 

A linear transformation between symplectic vector spaces £ : Vi, 771 — > V2 , 172 
is called a symplectic linear map if r]2(£(v), £(w)) = iji(v, w) for all v, w € Vi; 
In other words, if £*r]2 = r/i ■ The set of all symplectic linear isomorphisms from 
V, 77 to itself is called the symplectic group and denoted Sp(V,r/). With 
respect to a symplectic basis B a symplectic linear isomorphism £ is represented 
by a matrix A = [f]g that satisfies 

A l J A= J 

where J = J n is the matrix defined above and where 2 n = dim(V). Such a 
matrix is called a symplectic matrix and the group of all such is called the 
symplectic matrix group and denoted Sp(n,M.). Of course if dim(V) = 2 n 
then Sp(V,rf) = Sp(n, R) the isomorphism depending a choice of basis. If 77 is 
a symplectic from on V with dim(V) = 2 n then rf 1 G A 2 "V is nonzero and so 
orients the vector space V. 

Lemma 23.1 If A G Spin. M) then det(A) = 1. 

Proof. If we use A as a linear transformation R 2n — > R 2n then A*ao = Oo 
and A* a ft = aft where ao is the standard symplectic form on R 2n and aft G 
A 2n R 2n is top form. Thus clet A = 1 . m 
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Theorem 23.1 (Symplectic eigenvalue theorem) If X is a (complex) eigen- 
value of a symplectic matrix A then so is 1/A, A and 1/A. 

Proof. Let p( A) = det(T — XI) be the characteristic polynomial. It is easy 
to see that J* = — J and J TJ _1 = (T^ 1 ) 4 . Using these facts we have 

p( A) = det(J(T - XI) J- 1 ) = det(T _1 - XI) 

= clet (A- 1 (I - AT)) = det(7 - AT) 

= A 2 " det(\l - T)) = A 2 >(l/A). 

A 

So we have p( A) = A 2 "p(l/A). Using this and remembering that 0 is not an 
eigenvalue one concludes that 1/A and A are eigenvalues of T. ■ 

Exercise 23.1 With respect to the last theorem, show that X and 1/A have the 
same multiplicity. 

23.2 Canonical Form (Linear case) 

Suppose one has a vector space W with dual W* . We denote the pairing between 
W and W* by (.,.). There is a simple way to form a symplectic form on the 
space Z = W x W* which we will call the canonical symplectic form. This 
is defined by 

U((m,ai), (v 2 ,a 2 )) := (a 2 ,Vi) - (a\,v 2 ). 

If W is an inner product space with inner product (., .) then we may form the 
canonical symplectic from on Z = W x W by the same formula. As a special 
case we get the standard symplectic form on R 2n = R" x R" given by 

tt{{x,y), (x,y))=y-x-y-x. 


23.3 Symplectic manifolds 

Definition 23.2 T symplectic form on a manifold M is a nondegenerate 
closed 2- form to £ Q 2 (M) = T(M,T*M). A symplectic manifold is a pair 
(M, u>) where lo is a symplectic form on M . If there exists a symplectic form on 
M we say that M has a symplectic structure or admits a symplectic structure. 

A map of symplectic manifolds, say / : (M, uf) — > (TV, vo) is called a sym- 
plectic map iff f*w = u>. We will reserve the term symplectomorphism to 
refer to diffeomorphisms that are symplectic maps. Notice that since a symplec- 
tic form such as uj is nondegenerate, the 2 n form u" = u > A • • • Aw is nonzero and 
global. Hence a symplectic manifold is orientable (more precisely, it is oriented) . 

Definition 23.3 The form 0^ = ^Ar/)! w n is called the canonical volume 
form or Liouville volume. 
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We immediately have that if / : (M, u>) — » (M, w) is a symplectic diffeomor- 
phism then f* 

Not every manifold admits a symplectic structure. Of course if M does 
admit a symplectic structure then it must have even dimension but there are 
other more subtle obstructions. For example, the fact that if 2 (S' 4 ) = 0 can 
be used to show that S' 4 does not admit ant symplectic structure. To see this, 
suppose to the contrary that u is a closed nondegenerate 2- form on S 4 . The 
since H 2 (S 4 ) = 0 there would be a 1-form 6 with dd = to. But then since 
d(co A 9) = cv A w> the 4-form u A u> would be exact also and Stokes’ theorem 
would give f g4 co A u> = f s4 d(u> A 9) = / as4=0 u> A 8 = 0. But as we have seen 
u 2 = u> A u> is a nonzero top form so we must really have f g4 wAw / 0. So 
in fact, S 4 does not admit a symplectic structure. We will give a more careful 
examination to the question of obstructions to symplectic structures but let us 
now list some positive examples. 


Example 23.1 (surfaces) Any orientable surface with volume form (area form) 
qualifies since in this case the volume to itself is a closed nondegenerate two form. 

Example 23.2 (standard) The form iw can = dx l A dx l+n on R 2 " is the 
prototypical symplectic form for the theory and makes R" a symplectic manifold. 
(See Darboux’s theorem 23.2 below) 

Example 23.3 (cotangent bundle) We will see in detail below that the cotan- 
gent bundle of any smooth manifold has a natural symplectic structure. The 
symplectic form in a natural bundle chart ( q,p ) has the form u> = dq l Adpi. 

(warning: some authors use — "_i dq l A dpi = dp: A dq l instead). 

Example 23.4 (complex submanifolds) The symplectic R 2 ” may be consid- 
ered the realification of C n and then multiplication by i is thought of as a map 
J : R 2 " —4 R 2 ". We have that u> can (v, Jv) = — \v\ 2 so that u> can is nondegen- 
erate on any complex submanifold M of R 2ra and so M, w can | M is a symplectic 
manifold. 


Example 23.5 (coadjoint orbit) Let G be a Lie group. Define the coadjoint 
map Acl^ : G — > GL(g*) which takes g to Acl^ by 

Ad I(£)(A) =£(Ad g -i (x)). 

The action defined by Acl^ 

g -> g ' ? = Adg ($) 

is called the coadjoint action. Then we have an induced map acl^ : g — > gl(g*) 
at the Lie algebra level; 

adt (aO(0 (y) = -£([*, y])- 


The orbits of the action given by Ad* are called coadjoint orbits and we will 
show in theorem below that each orbit is a symplectic manifold in a natural way. 
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23.4 Complex Structure and Kahler Manifolds 


Recall that a complex manifold is a manifold modelled on C" and such that the 
chart overlap functions are all biholomorphic. Every (real) tangent space T p M 
of a complex manifold M has a complex structure J p : T p M — » T p M given in 
biholomorphic coordinates 2 = x + iy by 


j ( 

J P\ 


j ( 

J P\ a,.i 


d_ 

dx l 

d_ 

dy i 


d_ 

dy i j 
d 


and for any (biholomorphic) overlap function A = ip o ip 1 we have TA o J = 
JoTA. 


Definition 23.4 .4 n almost complex structure on a smooth manifold M is 
a bundle map J : TM — > TM covering the identity map such that J 2 = —id. 
If one can choose an atlas for M such that all the coordinate change functions 
(overlap functions) A satisfy TA o J = J o TA then J is called a complex 
structure on M . 


Definition 23.5 An almost symplectic structure on a manifold M is a 
nondegenerate smooth 2-form oj which is not necessarily closed. 

Theorem 23.2 A smooth manifold M admits an almost complex structure 
if and only if it admits an almost symplectic structure. 

Proof. First suppose that M has an almost complex structure J and let g 
be any Riemannian metric on M. Define a quadratic form q p on each tangent 
space by 

q P (v) = g P {v,v) + g p (Jv, Jv). 

Then we have q p (Jv) = q p (v). Now let h be the metric obtained from the 
quadratic form q by polarization. It follows that h(u, w) = h (Jv,Jw) for all 
v. w £ TM. Now define a two form oj by 

cj( v, w) = h(u, Jw). 

This really is skew-symmetric since u>(v, w) = h(i>, Jw) = h (Jv, J 2 w) = — h (Jv, w) 
to(w, v). Also, w is nondegenerate since if v ^ 0 then u>(v, Jv) = h(i>, v ) > 0. 

Conversely, let w be a nondegenerate two form on a manifold M. Once 
again choose a Riemannian metric g for M. There must be a vector bundle 
map il : TM TM such that 

u(v,w) = g(f 2v,w) for all v,w £ TM. 

Since u> is nondegenerate the map O must be invertible. Furthermore, since O 
is clearly anti-symmetric with respect to g the map — S2 o O = — f l 2 must be 
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symmetric and positive definite. From linear algebra applied fiberwise we know 
that there must be a positive symmetric square root for — fi 2 . Denote this by 
P = \/ — O 2 . Finite dimensional spectral theory also tell us that PCI = OP. 
Now let J = OP^ 1 and notice that 

J 2 = (fLP^XfhP- 1 ) = 0 2 P- 2 = -n 2 n~ 2 = - id . 


One consequence of this result is that there must be characteristic class 
obstructions to the existence of a symplectic structure on a manifolds. In fact, 
if M, u> is a symplectic manifold then it is certainly almost symplectic and so 
there is an almost complex structure J on M. The tangent bundle is then a 
complex vector bundle with J giving the action of multiplication by y/—l on 
each fiber T p M. Denote the resulting complex vector bundle by TM J and then 
consider the total Chern class 

c(TM J ) = c n (TM J ) + ... + ci (TM J ) + 1. 

Here Ci{TM J ) G H 2l (M,7i). Recall that with the orientation given by w" the 
top class c n (TM J ) is the Euler class e(TM) of TM. Now for the real bundle 
TM we have the total Pontrijagin class 

p(TM) = p n (TM) + ... + Pl (TM) + 1 

which are related to the Chern classes by the Whitney sum 

p(TM) = c(TM J ) © c(TM~ J ) 

= (. c n (TM J ) + ... + a (TM J ) + 1)((-1 ) n c n (TM J ) - +... + Cl {TM J ) + 1) 

where TM~ J is the complex bundle with — J giving the multiplication by \J — 1. 
We have used the fact that 

Cl (TM- J ) = (— 1 y Ci (TM J ). 

Now the classes pk{TM) are invariants of the diffeomorphism class of M an so 
can be considered constant over all possible choices of J. In fact, from the above 
relations one can deduce a quadratic relation that must be satisfied: 

Pk (TM) = c k {TM J f - 2 c k _ 1 (TM J )c k+1 (TM J ) + • • • + (-l) fe 2c 2fc (TM J ). 

Now this places a restriction on what manifolds might have almost complex 
structures and hence a restriction on having an almost symplectic structure. Of 
course some manifolds might have an almost symplectic structure but still have 
no symplectic structure. 

Definition 23.6 A positive definite real bilinear form h on an almost complex 
manifold M , J is will be called Hermitian metric or J -metric if h is J invariant. 

In this case h is the real part of a Hermitian form on the complex vector bundle 
TM, J given by 

( v , w) = h(v, w) + ih(Jv, w) 
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Definition 23.7 A diffeomorphism <j> : M, J, h — > M, J, /i is called a Hermitian 
isometry iffT(f>oJ = JoT<j) and 

h{T<pv,T(jyw) = h(v,w). 

A group action p : G x M — > M is called a Hermitian action if p{g , .) is 
a Hermitian isometry for all g. In this case, we have for every p £ M a the 
representation dp p : H p — » Aut (T p M, J p ) of the isotropy subgroup H p given by 

dp P (g)v = Tpp g ■ v. 

Definition 23.8 Let M, J be a complex manifold and u> a symplectic structure 
on M. The manifold is called a Kahler manifold if h(v,w) := u>(v,Jw) is 
positive definite. 

Equivalently we can define a Kahler manifold as a complex manifold 
M, J with Hermitian metric h with the property that the nondegenerate 2-form 
u>(v,w) := h(v,Jw) is closed. 

Thus we have the following for a Kahler manifold: 

1. A complex structure J , 

2. A J-invariant positive definite bilinear form b , 

3. A Hermitian form (v, w) = h(v , w) + ih(Jv, w). 

4. A symplectic form u> with the property that u>(v,w) = h(v, Jw). 

Of course if M, J is a complex manifold with Hermitian metric h then 
u(v,w) := h(v,Jw) automatically gives a nondegenerate 2-form; the question 
is whether it is closed or not. Mumford’s criterion is useful for this purpose: 

Theorem 23.3 (Mumford) Let p : GxM — > M be a smooth Lie group action 
by Hermitian isometries. Forp £ M let H p be the isometry subgroup of the point 
p. If J p £ dp p (H p ) for every p then we have that w defined by cv(v, w) := h(v, Jw) 
is closed. 

Proof, ft is easy to see that since p preserves both h and J it also preserves 
u> and du. Thus for any given p £ M, we have 

dhj(dp p (g)u,dp p (g)v,dp p (g)w) = du(u,v,w) 

for all g £ H p and all u, v, w £ T p M. By assumption there is a g p £ H p with 
J p = dp p (g p ). Thus with this choice the previous equation applied twice gives 

duj(u, v , w ) = duj(J p u, J p v, J p w ) 

= du>(J p u, J p v , J^w) 

= div(—u, —v, —w) = —dujfu, v, w) 

so duj = 0 at p which was an arbitrary point so dui = 0. ■ 

There is also Riemannian condition for the closedness of to. Since a Kahler 
manifold is a posteriori a Riemannian manifold it has associated with it the 
Levi-Civita connection V. In the following we view J as an element of 
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Theorem 23.4 For a Kahler manifold M, J, h with associated symplectic form 
u> we have that 

dco = 0 iff VJ = 0. 

23.5 Symplectic musical isomorphisms 

Since a symplectic form to on a manifold M is nondegenerate we have a map 

co b : TM — > T*M 

given by cj\,(X p )(v p ) = to(X p ,v p ) and the inverse cu^is such that 




or 

oo(iJ(a p ),v p ) = a p (v p ) 

Let check that co$ really is the inverse (one could easily be off by a sign in this 
business): 

Wb(w ti (ap))(up) = u)(u} t (a p ),v p ) = a p (v p ) for all v p 

=> = a P- 

Notice that co^ induces a map on sections also denoted by to & with inverse co\, : 
X(M) X*(M). 

Notation 23.1 Let us abbreviate cu“(a) to %ol and u>\,(v) to bv. 

23.6 Darboux’s Theorem 

Lemma 23.2 (Darboux’s theorem) On a 2 n- manifold ( M , u>) with a closed 
2-form co with co n ^ 0 (for instance if (M,co) is symplectic) there exists a sub- 
atlas consisting of charts called symplectic charts (a.k.a. canonical coordinates) 
characterized by the property that the expression for to in such a chart, is 

n 

cou = ^2 dx l A dx l+n 

i - 1 

and so in particular M must have even dimension 2 n. 

Remark 23.1 Let us agree that the canonical coordinates can be written (x i , yf) 
instead of(x l ,x l+n ) when convenient. 

Remark 23.2 It should be noticed that if x % ,yi is a symplectic chart then )J dx l 
must be such that 

H r. 

Y / dx r Ady r Wx\g-)=6) 

r= 1 
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but also 

n 3 n ( 3 3 \ 

J2dx r A dy r {Ux\ = E ( dx r {Ux)dy r {^-) - dy r (W)dx r (^-) J 

r—1 r—1 ' ' 

= -dy 3 {\ \dx l ) 

and so we conclude that j \dx l = — and similarly \dy l — q x , . 

Proof. We will use induction and follow closely the presentation in [?]. 
Assume the theorem is true for symplectic manifolds of dimension 2 (n — 1). 
Let p £ M. Choose a function y 1 on some open neighborhood of p such that 
dyi(p) ^ 0. Let X = §dyi and then X will not vanish at p. We can then choose 
another function x 1 such that Xx 1 = 1 and we let Y = —$dx l . Now since 
duj = 0 we can use Cartan’s formula to get 

£ X tO = CyUO = 0 . 

Next contract to with the bracket of X and Y (using the notation (X, to) = Lxto 
, see notation 10.1): 

(\X,Y],u) = (Lx Y,to) = C x (Y,to) - (Y,£ x lo) 

= Cxi-dx 1 ) = —d{X(x 1 )) = -dl = 0. 

Now since to is nondegenerate this implies that [A, Y] = 0 and so there must be 
a local coordinate system (x 1 ,yi,w 1 , ...,w 2n ~ 2 ) with 

oy i 

A = x 

dx 1 

In particular, the theorem is true if n = 1. Assume the theorem is true for 
symplectic manifolds of dimension 2(n — 1). If we let to' = to — dx 1 A dyi then 
since dto 1 = 0 and hence 

(X,u>') = C x u' = {Y,to') = C Y co' = 0 

we conclude that to' can be expressed as a 2-form in the w 1 , ..., w 2n ~ 2 variables 
alone. Furthermore, 


0 yf to n = (to — dx 1 A dyi) n 
= ±ndx l A dyi A ( to') n 

from which it follows that to' is the pullback of a form nondegenerate form vo on 
R 2n-2 _ Xo be exact if we let the coordinate chart given by (x 1 , yi, w 1 , ..., w 2n ~ 2 ) 
by denoted by % '> and let pr be the projection R 2n = K 2 x K 2 ™ -1 — > K 2 ™" 1 then 
to' = (pr o Thus the induction hypothesis says that t o' has the form 
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uj' = idx 1 ^ dy* f° r some functions x l ,yi with i = 2 It is easy to 
see that the construction implies that in some neighborhood of p the full set of 
functions x l ,yi with i = 1, ...,n form the desired symplectic chart. ■ 

An atlas A of symplectic charts is called a symplectic atlas. A chart ( U , (p ) 
is called compatible with the symplectic atlas A if for every (4> a ,U a ) € A we 
have 

= U 0 

for the canonical symplectic u> ca n = Yli = l A du l+n defined on if a (U D U a ) C 

R 2 ” using standard rectangular coordinates u 1 . 


23.7 Poisson Brackets and Hamiltonian vector 
fields 

Definition 23.9 (on forms) The Poisson bracket of two 1-forms is defined 
to be 

{a,/3}± = T KtM/?] 

where the musical symbols refer to the maps oA and u>\,. This puts a Lie algebra 
structure on the space of 1-forms Q 1 (M) = X*(M). 

Definition 23.10 (on functions) The Poisson bracket of two smooth func- 
tions is defined to be 


{f,g}± = ±w(Mf, Ug) = ±uj(x f ,x g ) 

This puts a Lie algebra structure on the space P(M) of smooth function on 
the symplectic M. It is easily seen (using dg = ix g u) that {/, g}± = ±Lx g f = 
pLx f g which shows that / i— > {.f,g} is a derivation for fixed g. The connection 
between the two Poisson brackets is 

d{f,g}± = {df , dg}±. 

Let us take canonical coordinates so that u> = Ya= i dx l /\dyi. If X p = dx l (X) + 
E?=i dyi(X and v p = dx l (v p ) + dyi(v p ) then using the Einstein sum- 
mation convention we have 

u\>(X)(v p ) 

= u{dx\X)-^ i +dy i {X)-^,dx\v p )-^ i +dy i {v p )-^-) 

= ( dx\X)dyi - dyi(X)dx l )(v p ) 


so we have 

Lemma 23.3 u)\,(X p ) = J2i=idx'(X)dyi - dyi{X)dx l = YAi=i{~dyi{X)dx l + 
dx l (X)dy l ) 
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Corollary 23.1 Ifa = J™= i a (^) dxi +Y^=i a (wJ dyi then ^( a ) = E"=i a (^)gfr- 

An now for the local formula: 

Corollary 23.2 {f,g} = ~ § &) 

Proof, df = §^dx i + §^dyi and dg = Js ' T dx j + so ft# = - 

and similarly for dg. Thus (using the summation convention again); 

{f,g} = u($df,$dg) 

df d df d dg d dgd 

W dy, dx l dx l dy^ ’ dyi dxi dxl dyi 

_ df_dg_ _ df_dg_ 
dx l dyi dyi dx l 

■ 

A main point about Poison Brackets is 

Theorem 23.5 / is constant along the orbits of X g iff {/, g} = 0. In fact, 

|yoFlf'=0^ {/,<?} = 0 ^£f oFlf-=0 

Proof. ±g oFlf' = (Flf ')*£*,</ = (Flf ')*{/> $}• Also use {f,g} = 

—{9i /}• ■ 

The equations of motion for a Hamiltonian H are 

|/ o Flfu = ±{/ o FI f H ,H} ± = / o Flf *}± 

which is true by the following simple computation 

j t f oFlf« = |(Flf-)*/ = (Fl t Y «)*L XH / 

= W/oFlf«) = {/oFlf",ff} ± . 

Notation 23.2 From now on we will use only {. unless otherwise indicated 
and shall write {., .} for {., .}+. 

Definition 23.11 A Hamiltonian system is a triple (. M,u),H ) where M is a 
smooth manifold, c o is a symplectic form and H is a smooth function H : M —> 

R. 

The main example, at least from the point of view of mechanics, is the 
cotangent bundle of a manifold which is discussed below. From a mechanical 
point of view the Hamiltonian function controls the dynamics and so is special. 

Let us return to the general case of a symplectic manifold M, to 
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Definition 23.12 Now if H : M — > R is smooth then we define the Hamilto- 
nian vector field Xh with energy function H to be ui^dH so that by definition 
l Xh U) = dH. 

Definition 23.13 A vector field X on M,u> is called a locally Hamiltonian 
vector field or a symplectic vector field iff Lxoj = 0 . 

If a symplectic vector field is complete then we have that (Fl^)*u; is defined 
for all f £ R. Otherwise, for any relatively compact open set U the restriction 
Flf to U is well defined for all t < b(U) for some number depending only on U. 
Thus (Fl Y ")*w is defined on U for t < b(U). Since U can be chosen to contain 
any point of interest and since M can be covered by relatively compact sets, it 
will be of little harm to write (Fl Y ')*w even in the case that X is not complete. 

Lemma 23.4 The following are equivalent: 

1. X is symplectic vector field, i.e. Lxoj = 0 

2. lxcj is closed 

3. (Flf)*w = w 

4- X is locally a Hamiltonian vector field. 

Proof. (1)+=+ (4) by the Poincare lemma. Next, notice that Lxu) = 
d o t X (V + Lx ° du = d o ix<jJ so we have (2)+=+(l). The implication (2)+=>(3) 
follows from Theorem 7.8. ■ 

Proposition 23.2 We have the following easily deduced facts concerning Hamil- 
tonian vector fields: 

1. The H is constant along integral curves of Xh 

2. The flow of Xh is a local symplectomorphism. That is F1^ H *oj = u 

Notation 23.3 Denote the set of all Hamiltonian vector fields on M , u> by TL(u>) 
and the set of all symplectic vector fields by SV(u>) 

Proposition 23.3 The set SV(cv) is a Lie subalgebra ofX(M). In fact, we have 
[SV{u),SP{u)\ C H(u) C X(M). 

Proof. Let X,Y £ SV{u>). Then 

[X,Y]ju> = C x Y_iuj = C x {Yau) - YjLxoj 
= d{X_Y_>u) + X_id(T_,w) - 0 

= d(x_iy_iw) + o + o 

= —d(u(X,Y)) = —X w (x,y)-M 

and since w in nondegenerate we have [X, Y] = X_ u rx,Y) G Hiio). ■ 
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23.8 Configuration space and Phase space 

Consider the cotangent bundle of a manifold Q with projection map 

n:T*Q-*Q 

and define the canonical 1-form 6 £ T* ( T*Q ) by 

9 :v olp '-^ a p {Tn ■ v ap ) 

where a p £ T*Q and v ap £ T ap [T* Q) . In local coordinates this reads 

9o = ^2 Pi d q ’• 

Then lot-q = —dO is a symplectic form which in natural coordinates reads 

wt*q = ^2 A dPi 

Lemma 23.5 9 is the unique 1-form, such that for any f3 € 0 1 (Q) we have 

13*9 = (3 

where we view (3 as (3 : Q — > T* Q . 

Proof: (3*6(v q ) = 9\^ q) {T/3 ■ v q ) = f3(q)(Tn o Tf3 ■ v q ) = j3(q)[v q ) since 
Tir o T(3 = T(n o (3) = T(id) = id. 

The cotangent lift T* f of a diffeomorphism / : Q\ — > Q 2 is defined by the 
commutative diagram 

T*Q 1 tl T*Q 2 

I I 

Qi Q 2 

and is a symplectic map; i.e. (T*/)* lo 0 = w 0 - In fact, we even have (T* /)* 9 0 = 

e 0 . 

The triple (T*Q,u>t*q, H) is a Hamiltonian system for any choice of smooth 
function. The most common form for H in this case is \K + V where K is 
a Riemannian metric which is constructed using the mass distribution of the 
bodies modelled by the system and V is a smooth potential function which, 
in a conservative system, depends only on q when viewed in natural cotangent 
bundle coordinates q l ,Pi- 

Now we have j \dg = ^ and introducing the ± notation one 

more time we have 


{f,9}± 


±u T *Q(UfAdg) = ±df($dg) = 

dpi dq l dq l dpi 
± , df dg _ df dg 
dq l dpi dpi dq l 


dg d 
dq l dpi 
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Thus letting 




the equations of motions read 


J t fm,p{i)) = |/oFlf- = {/oFlf",R} 

_ df dH df dH 

dq l dpi dpi dq l 

Where we have abbreviated / o Fl^ Yff to just /. In particular, if / = q l and 
/ = Pi then 


q\t) 


h{t) 


DH 
dpi 
dH 
~ dq l 


which should be familiar. 


23.9 Transfer of symplectic structure to the Tan- 
gent bundle 

Case I: a (pseudo) Riemannian manifold 

If Q,g is a (pseudo) Riemannian manifold then we have a map g b :TQ—+T*Q 
defined by 

g b 0)O) = g(v,w) 

and using this we can define a symplectic form vjq on TQ by 

= (g b ) w 

(Note that dm 0 = d(g b *uj) = g b *dui = 0.) In fact, m 0 is exact since u> is exact: 

m 0 = (g b ) w 

= (g b Y d8 = d(g i ’*0) . 

Let us write 0 q = g b *0. Locally we have 


0 o (x, v)(vi,v 2 ) = g x (v, «i) or 
©o = 'Y^gijcfdq 3 
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and also 

zuo(x,v)((v 1 ,v 2 ), ((w 1 ,w 2 ))) 

= gx(w 2 ,v i) - gx(v 2 ,W!) + D x g x (v,v l) • Wi - D x g x (v,Wi) ■ V-L 
which in classical notation (and for finite dimensions) looks like 

h = gijdq 1 A dq 3 + ^ ff^t^dq 3 A ^ 


Case II: Transfer of symplectic structure by a Lagrangian function. 

Definition 23.14 Let L : TQ — > Q be a Lagrangian on a manifold Q. We say 
that L is regular or non- degenerate at £ £ TQ if in any canonical coordinate 
system (q,q) whose domain contains the matrix 


' d 2 L 
dq l dqi 


(9(0, m ) 


is non-degenerate. L is called regular or nondegenerate if it is regidar at all 
points in TQ. 


We will need the following general concept: 


Definition 23.15 Let tte ■ E — > M and ttf '■ F —> M be two vector 
A map L : E — » F is called a fiber preserving map if the following 
commutes 


E 


F 


bundles. 

diagram 


7T£ \ / ■ 

M 


We do not require that the map L be linear on the fibers and so in general L is 
not a vector bundle morphism. 

Definition 23.16 If L : E — » F is a fiber preserving map then if we denote the 
restriction of L to a fiber E p by L p define the fiber derivative 

FL : E — > Horn (E, F) 

by FL : e p e-> Df\ p (e p ) for e p £ E p . 

In our application of this concept, we take F to be the trivial bundle Q x K. 
over Q so Hom(Ti,F) = Hoinfff, R) = T*Q. 

Lemma 23.6 A Lagrangian function L : TQ K gives rise to a fiber deriva- 
tive FL : TQ — > T*Q. The Lagrangian is nondegenerate iffFL is a diffeomor- 
phism. 
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Definition 23.17 The form wl is defined by 

wl = (FL)* u> 


Lemma 23.7 u>l is a symplectic form on TQ iff L is nondegenerate (i.e. if 
F L is a diffeomorphism) . 


Observe that we can also define d L = (FL)* 6 so that d0 L = d(FL)* 0 = 
(F L)* dd = (FL)* to = wl so we see that u>l is exact (and hence closed a 
required for a symplectic form). 

Now in natural coordinates we have 


wl 


d 2 L 

dffdqi 


dq 1 A dqi + 


d 2 L 

dq l dqi 


dq 1 A dq J 


as can be verified using direct calculation. 

The following connection between the transferred forms wl and wo and 
occasionally not pointed out in some texts. 


Theorem 23.6 Let V be a smooth function on a Riemannian manifold M, h. 
If we define a Lagrangian by L — — V then the Legendre transformation 

FL :: TQ — > T*Q is just the map g b and hence wl = Wh- 

Proof. We work locally. Then the Legendre transformation is given by 

q i ^q i 

But since L( q, q) = |g(q,q) - V(q) we have = -^\gkiq l q k = guq 1 which 
together with q l \-^q l is the coordinate expression for g b : 

q^giiq 1 


23.10 Coadjoint Orbits 

Let G be a Lie group and consider Acl^ : G —> GL(g*) and the corresponding 
coadjoint action as in example 23.5. For every £ £ g* we have a Left invariant 
1-form on G defined by 

9^ = f o ojq 

where u>g is the canonical g- valued 1-form (the Maurer Cartan form). Let the 
be the isotropy subgroup of G for a point (£ g* under the coadjoint action. 
Then it is standard that orbit G ■ f is canonically diffeomorphic to the orbit 
space G/G £ and the map </>{ : g <— > g • £ is a submersion onto . Then we have 
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Theorem 23.7 There is a unique symplectic form on G/G % = G ■ f such 
that (f>^I 2 ^ = dd^ . 

Proof: If such a form as f if exists as stated then we must have 

Q^(T(j>£.v,T(f)£.w) = dO^(v,w) for all v,w € T g G 

We will show that this in fact defines fd as a symplectic form on the orbit G - f. 
First of all notice that by the structure equations for the Maurer Cartan form 
we have for v, w £ T e G = 0 

d0*{v,w) = f(daj G {v,w)) = £(w G ([u,te])) 

= £(“[«,«>]) = ad f (¥)(£)(>) 

From this we see that 

ad f (t;)(0 = 0 <=> v £ Null(d0*| e ) 

where Null(d0^| e ) = {v G 0 : dffi \ e ( v , w) for all w £ 0 }. On the other hand, 
G^ = ker{g 1 — » AclJ(^)} so ad^(u)(£) = 0 iff v £ T e Gc = 0 ^. 

Now notice that since dd^ is left invariant we have that Null(d0^| ) = 
TL g (g^) which is the tangent space to the coset gG j which is also ker T<j >£ | . 
Thus we conclude that 


Null(d0^| ) = ker T<j >% | g . 

It follows that we have a natural isomorphism 

T g< (G ■ 0 = T^\ g (T g G) « T g G/(TLM) 

Another view: Let the vector field on G ■ f corresponding to v, w £ 0 
generated by the action be denoted by id and ltd . Then we have fi^(£)(id, ud) := 
£(— [i>, w]) at £ £ G ■ £ and then extend to the rest of the points of the orbit by 
equi variance: 

&(g ■ 0( v ^ w ^) = Ad(£(— [u,ic])) 

9 


23.11 The Rigid Body 

In what follows we will describe the rigid body rotating about one of its points in 
three different versions. The basic idea is that we can represent the configuration 
space as a subset of R. 3N with a very natural kinetic energy function. But this 
space is also isomorphic to the rotation group SO( 3) and we can transfer the 
kinetic energy metric over to SO( 3) and then the evolution of the system is given 
by geodesics in SO(3) with respect to this metric. Next we take advantage of 
the fact that the tangent bundle of SO( 3) is trivial to transfer the setup over 
to a trivial bundle. But there are two natural ways to do this and we explore 
the relation between the two. 
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23.11.1 The configuration in M 3iV 

Let us consider a rigid body to consist of a set of point masses located in R 3 
at points with position vectors ri(£), ...rjv(t) at time t. Thus r* = (xi, x 2 , xf) 
is the coordinates of the i - th point mass. Let mi, ..., denote the masses 
of the particles. To say that this set of point masses is rigid is to say that 
the distances |r, — r 7 j are constant for each choice of i and j. Let us assume 
for simplicity that the body is in a state of uniform rectilinear motion so that 
by re-choosing our coordinate axes if necessary we can assume that the there 
is one of the point masses at the origin of our coordinate system at all times. 
Now the set of all possible configurations is some submanifold of R. 3N which 
we denote by M. Let us also assume that at least 3 of the masses, say those 
located at ri,r 2 ,r 2 are situated so that the position vectors ri,r 2 ,r 2 form a 
basis of M 3 . For convenience let r and r be abbreviations for (r-[ (t ), ..., rjv(f)) and 
(ri (£),..., rjv(£)).The correct kinetic energy for the system of particles forming 
the rigid body is \K{ r, r) where the kinetic energy metric K is 

A'(v, w) = TOiVi • Wi + • • • + m.N'VN ■ w n- 

Since there are no other forces on the body other than those that constrain the 
body to be rigid the Lagrangian for M is just \K{ r, r) and the evolution of the 
point in M representing the body is a geodesic when we use as Hamiltonian K 
and the symplectic form pulled over from T* M as described previously. 

23.11.2 Modelling the rigid body on 50 ( 3 ) 

Let rj(0), ...rOjjy denote the initial positions of our point masses. Under these 
condition there is a unique matrix valued function g(t) with values in SO{ 3) such 
that r i(t) = <jf(f)rj(0). Thus the motion of the body is determined by the curve 
in 50(3) given by 1 1 — > g(t). In fact, we can map SO( 3) to the set of all possible 
configurations of the points making up the body in a 1-1 manner by letting 
r i(0) = £ 1 , ••■r(0) JV = and mapping <F : g (g£ 1 , ...,g£ N ) € M C R 3N . If we 
use the map 1> to transfer this over to TSO( 3) we get 

k(£, v ) = K(T<& ■ C Ti> ■ v) 

for TSO( 3). Now k is a Riemannian metric on 50(3) and in fact, k is a 

left invariant metric: 

k(£,v) = k(TL g £,TL g v) for all £ TSO(3). 

Exercise 23.2 Show that k really is left invariant. Hint: Consider the map 
V go : (vi, • • • , v N ) e-> (p 0 vi, • • • ,5 0 vn) for g 0 eSO(3) and notice that /j J() o$ = 
$ o L go and hence Tg go o Td> = o TL go . 

Now by construction, the Riemannian manifolds M,K and 50(3), k are 
isometric. Thus the corresponding path g(t) in 50(3) is a geodesic with respect 
to the left invariant metric k. Our Hamiltonian system is now (T50(3), flk, k) 
where is the Legendre transformation of the canonical symplectic form fi on 
T*SO{ 3) 
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23.11.3 The trivial bundle picture 

Recall that we the Lie algebra of £0(3) is the vector space of skew-symmetric 
matrices so(3). We have the two trivializations of the tangent bundle TSO( 3) 
given by 


tri y L (v g ) = (g,u G (v g )) = (g,g 1 v g ) 
trxv R (v g ) = (g,uj G (v g )) = 

with inverse maps £0(3) x so(3) — * TSO{ 3) given by 

(. 9 ) B) i ► TL g B 
(g,B) i ^ TR g B 

Now we should be able to represent the system in the trivial bundle £0(3) x 
so(3) via the map trivz,(u s ) = ( g,u)c{v g )) = ( g,g~ 1 v a ). Thus we let ko be the 
metric on £0(3) x so(3) coming from the metric k. Thus by definition 

ko((ff, v), (g, to)) = k (TL g v,TL g w) = k e (v, w) 
where v,w £ so (3) are skew-symmetric matrices. 


23.12 The momentum map and Hamiltonian ac- 
tions 


Remark 23.3 In this section all Lie groups will be assumed to be connected. 


Suppose that ( a connected Lie group) G acts on M,w as a group of sym- 
plectomorphisms . 


cr : G x M —> M 


Then we say that a is a symplectic G-action . Since G acts on M we have for 
every v £ q the fundamental vector field X v = v a . The fundamental vector field 
will be symplectic (locally Hamiltonian) . Thus every one-parameter group g f of 
G induces a symplectic vector field on M. Actually, it is only the infinitesimal 
action that matters at first so we define 


Definition 23.18 Let M be a smooth manifold and let g be the Lie algebra of 
a connected Lie group G. A linear map a' : v i— > X v from g into X(M) is called 

a g-action if 


[. X V ,X W } = -x^ v ’ w ^ or 
[a\v),a'{w)\ = -a'{[v,w\). 


If M,u> is symplectic and the g-action is such that Cx^lo = 0 for all v £ g we 
say that the action is a symplectic g-action. 
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Definition 23.19 Every symplectic action a : G x M — * M induces a g-action 
da via 


where X v (x ) 


da : v i— > X v 


d_ 

dt 


a(exp(tv), x). 
o 


In some cases, we may be able to show that for all v the symplectic field 
X v is a full fledged Hamiltonian vector field. In this case associated to each 
there is a Hamiltonian function J v = .Jx« with corresponding Hamiltonian 
vector field equal to X v and J v is determined up to a constant by X v = $dJx v ■ 
Now i X vU is always closed since dix^ui = Cx^oj. When is it possible to define 
J v for every v £ g ? 


Lemma 23.8 Given a symplectic g-action a' : v i— > X u as above, there is a 
linear map v i— > J v such that X v = §dJ v for every v £ g iff lx^oj is exact for 
all v £ g. 


Proof. If H v = Hx^ exists for all v then dJx™ = oj(X v , .) = for all v 
so LxvU) is exact for all v £ g. Conversely, if for every v £ g there is a smooth 
function h v with dh v = lx^oj then X v = $dh v so h v is Hamiltonian for X v . 
Now let vi,...,v n be a basis for g and define J Vi = h Vi and extend linearly. ■ 
Notice that the property that v i— > J v is linear means that we can define a 
map J : M —> g* by 

J{x){v) = J v (x) 

and this is called a momentum map . 


Definition 23.20 A symplectic G-action a (resp. g-action a’ ) on M such that 
for every «£g the vector field X v is a Hamiltonian vector field on M is called 
a Hamiltonian G-action (resp. Hamiltonian g-action ). 


We can thus associate to every Hamiltonian action at least one momentum 
map-this being unique up to an additive constant. 


Example 23.6 If G acts on a manifold Q by diffeomorphisms then G lifts to 
an action on the cotangent bundle T*M which is automatically symplectic. In 
fact, because ojq = dO o is exact the action is also a Hamiltonian action. The 
Hamiltonian function associated to an element v £ g is given by 

Mx) = e 0 



Definition 23.21 If G (resp. g) acts on M in a symplectic manner as above 
such that the action is Hamiltonian and such that we may choose a momentum 
map J such that 


J[v,w] { d v , J u! } 


where J v (x ) = J(x)(v) then we say that the action is a strongly Hamiltonian 
G-action (resp. g-action). 
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Example 23.7 The action of example 23.6 is strongly Hamiltonian. 


We would like to have a way to measure of whether a Hamiltonian action is 
strong or not. Essentially we are just going to be using the difference J\ VtW \ — 
{J v , J w } but it will be convenient to introduce another view which we postpone 
until the next section where we study “Poisson structures”. 

PUT IN THEOREM ABOUT MOMENTUM CONSERVATION!!!! 

What is a momentum map in the cotangent case? Pick a fixed point a £ T*Q 
and consider the map <I> Q : G — * T*Q given by $ a {g) = g • a. = g~ 1 *a. Now 
consider the pullback of the canonical 1-form <f>*0o. 


Lemma 23.9 The restriction 4>*6 , o| £l is an element of g* and the map a i— > 
$*0 O | 0 is the momentum map. 

Proof. We must show that <I>* 0q| 0 (v) = H v (a ) for all v £ g. Does 
<P*Jol g (v) live in the right place? Let g* = exp(t'f) . Then 


(T e $ a v) 


d 
dt 
d I 


$ a (exp(ut)) 

o 

(exp(— vt))*a 


d_ 

dt 


exp (vt) ■ a 
o 


We have 


*:%(») = »oi, < t a>») 




exp (vt) • a) = J v (a) 


Definition 23.22 Let G act on a symplectic manifold M,u> and suppose that 
the action is Hamiltonian. A momentum map J for the action is said to be 
equivariant with respect to the coadjoint action if J(g ■ x) = Ad*-i J(x). 
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Chapter 24 

Poisson Geometry 


Life is good for only two things, discovering mathematics and teaching 
mathematics 

-Simeon Poisson 


24.1 Poisson Manifolds 

In this chapter we generalize our study of symplectic geometry by approaching 
things from the side of a Poisson bracket. 

Definition 24.1 A Poisson structure on an associative algebra A is a Lie 
algebra structure with bracket denoted by {., .} such for a fixed a £ A that the 
map x i— > {a, x} is a derivation of the algebra. An associative algebra with 
a Poisson structure is called a Poisson algebra and the bracket is called a 

Poisson bracket . 

We have already seen an example of a Poisson structure on the algebra 5 ( M ) 
of smooth functions on a symplectic manifold. Namely, 

{f,g} = uj(u t df,w i dg). 

By the Darboux theorem we know that we can choose local coordinates (q 1 , ...,q n ,pi, ...,p„) 
on a neighborhood of any given point in the manifold. Recall also that in such 
coordinates we have 


df = ^2 

i = 1 


df_d_ 
dpi dq i 


y- df_d_ 

dq‘ dpi 


sometimes called the symplectic gradient. It follows that 

y '^, df dg _ df dg 
^ dq l dpt dpi dq l 

i—1 
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Definition 24.2 A smooth manifold with a Poisson structure on is algebra of 
smooth functions is called a Poisson manifold. 

So every symplectic n-manifold gives rise to a Poisson structure. On the 
other hand, there are Poisson manifolds that are not so by virtue of being a 
symplectic manifold. 

Now if our manifold is finite dimensional then every derivation of $(M) is 
given by a vector field and since g i— > {/, g} is a derivation there is a correspond- 
ing vector field Xf. Since the bracket is determined by these vector field and 
since vector fields can be defined locally ( recall the presheaf Xm) we see that 
a Poisson structure is also a locally defined structure. In fact, U e- ► $m(U) is a 
presheaf of Poisson algebras. 

Now if we consider the map w : 5 m — ► jEm defined by {f,g} = w(f) ■ g we 
see that {f,g} = w(f) ■ g = —w(g) ■ f and so {/, g}(p) depends only on the 
differentials df,dg of / and g. Thus we have a tensor B(., .) £ T /\ 2 TM such 
that B(df,dg) = {f,g}. In other words, B p (.,.) is a symmetric bilinear map 
T*M xT*M — » R. Now any such tensor gives a bundle map : T*M i— > 
T**M = TM by the rule (a)(0) = B(0,a) for f3,a £ T*M and any p £ M. 
In other words, B(0, a) = 0(B^(a)) for all (3 £ T*M and arbitrary p £ M. The 
2- vector B is called the Poisson tensor for the given Poisson structure. B is 
also sometimes called a co-symplectic structure for reasons that we will now 
explain. 

If M. w is a symplectic manifold then the map oj\, : TM — > T*M can be 
inverted to give a map : T*M — > TM and then a form W £ /\ 2 TM defined 
by (a) (0) = W (0, a) (here again /?, a must be in the same fiber) . Now this 
form can be used to define a Poisson bracket by setting {/, g} = W(df, dg) and 
so W is the corresponding Poisson tensor. But notice that 

{.A g} = W(df,dg) = J(dg)(df) = df(J(dg)) 

= w(oj^df, w^dg) 

which is just the original Poisson bracket defined in the symplectic manifold 
M, LU. 

Given a Poisson manifold M, {., .} we can always define {.,.}_ by {/, g}~ = 
{g, /}. Since we some times refer to a Poisson manifold M, {., .} by referring 
just to the space we will denote M with the opposite Poisson structure by M~ . 

A Poisson map is map <f> : M, N, {.,.}2 is a smooth map such that 

r {/,<?} = {Pf,Fg} for all f,g£ 5(M). 

For any subset S' of a Poisson manifold let So be the set of functions from 
5(M) which vanish on S. A submanifold S of a Poisson manifold M, {., .} is 
called coisotropic if So closed under the Poisson bracket. A Poisson manifold 
is called symplectic if the Poisson tensor B is non-degenerate since in this case 
we can use B ** to define a symplectic form on M. A Poisson manifold admits 
a (singular) foliation such that the leaves are symplectic. By a theorem of A. 
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Weinstien we can locally in a neighborhood of a point p find a coordinate system 
(q\Pi, w l ) centered at p and such that 

d d 1 i A d d 

~ ^ dq l dpi + 2 ^ a ^ dw i dwi 
*= i hi 

where the smooth functions depend only on the w’s. vanish at p. Here k is the 
dimension of the leave through p. The rank of the map B ** on T* M is k. 

Now to give a typical example let g be a Lie algebra with bracket [., .] and g* 
its dual. Choose a basis ei, e n of g and the corresponding dual basis e 1 , ..., e n 
for g*. With respect to the basis ei, ..., e n we have 

[o, ej] = Yl C ij ek 

where Cj'„ are the structure constants. 

For any functions f,g£ S'(g*) we have that df a ,dg a are linear maps g* — > K. 
where we identify T a g* with g*. This means that df a , dg a can be considered to 
be in g by the identification g** = g. Now define the ± Poisson structure on g* 

by 

{f,g}±{ai) = ±a([df a ,dg a ]) 

Now the basis ei, ...,e n is a coordinate system y on g* by yi(a) = a(ei). 


Proposition 24.1 In terms of this coordinate system the Poisson bracket just 
defined is 


{f,g}± 


± 5Z b b 

i= 1 


df_dg_ 
dyi dyj 


where Bij = J2 C ijVk ■ 


Proof. We suppress the ± and compute: 


{f,g} 


i**i = E 


dg 




df dg 


ijVk 


Vb d l d l- 

h l3d vi d vi 
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Chapter 25 


Quantization 


25.1 Operators on a Hilbert Space 

A bounded linear map between complex Hilbert spaces A : TL\ — + H 2 is said 
to be of finite rank iff the image A( Hi) C H 2 is finite dimensional. Let us 
denote the set of all such finite rank maps by J r (7f 1 ,7Y 2 )- If Hi = H 2 we write 
T{H). The set of bounded linear maps is denoted B(Hi,H 2 ) and is a Banach 
space with norm given by ||A|| b ^ Hi H ^ := sup{||At>|| : ||u|| = 1}. Let B(H) be 
the algebra of bounded operators which is then a Banach space with the norm 
given by ||A||g^ := sup{||Av|| : ||i>|| = 1}. The reader will recall that B(H) is 
in fact a Banach algebra meaning that in we have 

\\m 

13(H) — W A Wb(H) W B Wb(H) ■ 

We will abbreviate || A\\ B ^ to just || A|| when convenient. Recall that the adjoint 
of an element A G B(H) is that unique element A* defined by (A*v, w) = ( v , Aw) 
for all v, w G H. One can prove that in fact we have 

U*A\\ = || A|| 2 

One can define on B{H) two other important topologies. Namely, the strong 
topology is given by the family of seminorms A 1 — > || Av\\ for various v G H and 
the weak topology given by the family of seminorms A 1 — > ( Av , w) for various 
v. w G H. Because of this we will use phrases like “norm-closed”, “strongly 
closed” or “weakly closed” etc. 

Recall that for any set W C H we define W 1 - := {v : (v,w) = 0 for all 
w G IT}. The following fundamental lemma is straightforward to prove. 

Lemma 25.1 For any A G B(H) we have 

img(A*) = ker(A) _L 
ker(A*) = img(A) J_ . 
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Lemma 25.2 T(H) has the following properties: 

1) F £ T{H) => AoF £ T(H) and F o A £ F(H) for all A £ 13(H). 

2) F £ T(TL) => 5* e JF(X). 

TTiws F(H) is a two sided ideal of 3(H) closed under *. 

Proof. Obvious. ■ 


Definition 25.1 An linear map A £ B(H\,Hf) is called compact if A(5(0, 1)) 
is relatively compact (has compact closure) in H 2 - The set of all compact oper- 
ators Hi — » H- 2 . is denoted 1C(H 1 ,H2 ) or if Hi = H -2 by IC(H). 

Lemma 25.3 TfH^HT) is dense in JC(H 1 ,H 2 ) and K.(H 1 ,H 2 ) is closed in 

B(Hi,H 2 ). 

Proof. Let A £ KAfH^Hf)- For every small e > 0 there is a finite subset 
{y-\ ,y 2 , !Jk} C 5(0,1) such that the e— balls with centers yi, j/ 2 , Vk cover 
A(B(0, 1)). Now let P be the projection onto the span of {Ayi, Ay 2 , .., Ayk} ■ 
We leave it to the reader to show that PA £ ^(H^Hf) and || PA — A|| < e. It 
follows that tF(H l ,H 2 ) is dense in IC(H 1 ,H 2 )- 

Let {A n }„>o be a sequence in K,(fH 1 ,H 2 ) which converges to some A £ 
3(Hi,H2). We want to show that A £ JC(H 1 ,H 2 ). Let e > 0 be given and 
choose no > 0 so that \\A n — A\\ < e. Now A„ 0 is compact so A„ o (5(0, 1)) is 
relatively compact. It follows that there is some finite subset N C 5(0, 1) such 
that 

A„ 0 (5(M))C U B ( a ’ e )' 

a£N 

Then for any y £ 5(0, 1) there is an element a £ N such that 
\\A(y) - A(a)\\ 

< P(l/) “ A no (y ) || + ||A no (y) — A no (o)|| + ||A no (a) - A(a)| 

< 3e 


and it follows that A(5(0, 1)) is relatively compact. Thus A £ 1C(H 1 ,H2)- ■ 

Corollary 25.1 JC(H) is a two sided ideal in 3(H). 

We now give two examples (of types) of compact operators each of which is 
prototypical in its own way. 

Example 25.1 For every continuous function K (., .) £ L 2 ([ 0, 1] x [0, 1]) gives 
rise to a compact integral operator K on 5 2 ([0, 1]) defined by 

(A K f)(x) := f K(x,y)f(y)dy. 

J [0,1] x [0,1] 
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Example 25.2 If TL is separable and we fix a Hilbert basis {e^ } then we can 
define an operator by prescribing its action on the basis elements. For every 
sequence of numbers {A,} C C such that Xi — > 0 we get an operator A^.j 
defined by 

A{\i} e i = A *ej. 

This operator is the norm limit of the finite rank operators A n defined by 

{ Xid if i <n 

■ 

0 if i > n 

Thus A{ y .y is compact by lemma 25.3. This type of operator is called a diagonal 
operator (with respect to the basis {ei}). 

25.2 C*-Algebras 

Definition 25.2 A C algebra of (bounded) operators on TL is a norm- 
closed subalgebra 21 of B(Tt) such that 

A* G 21 for all A G 21. 

A trivial example is the space B(TL) itself but beyond that we have the 
following simple but important example: 

Example 25.3 Consider the case where TL is L 2 (X,p) for some compact space 
X and where the measure p is a positive measure (i.e. non-negative) such 
that p{U) > 0 for all nonempty open sets U. Now for every function f G 
C(X) we have a multiplication operator Mf : L 2 (X,g) — > If 2 (X,p) defined by 
Mf(g ) = fg. The map f i— > Mf is an algebra monomorphism from C(X) into 
B(TL) := B(L 2 (X, p)) such that f i— > Mj = Mj. Thus we identify C(X) with a 
subspace of B(Tt) which is in fact a C* algebra of operators on TL. This is an 
example of a commutative C* -algebra. 

Example 25.4 (!) Using corollary 25.1 one can easily see that IC(TL) is a C* 
algebra of operators on TL. The fact that KAfTL) is self adjoint ( closed under 
adjoint A i— > A*) follows from the self adjointness of the algebra LF(TL). 

Definition 25.3 A C* algebra of operators on TL is called separable if it has a 
countable dense subset. 

Proposition 25.1 The C* algebra B(TL) itself is separable iff TL is finite di- 
mensional. 

Remark 25.1 If one gives B(TL) the strong topology then B(TL) is separable iff 
TL is separable. 

Proposition 25.2 The algebra of multipliers 971 = C(X) from example 25.3 is 
separable iff X is a separable compact space. 
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In the case that Tt is finite dimensional we may as well assume that TL = C" 
and then we can also identify B(H) with the algebra of matrices M raxn (C) where 
now A* refers to the conjugate transpose of the matrix A. On can also verify 
that in this case the “operator” norm of a matrix A is given by the maximum 
of the eigenvalues of the self-adjoint matrix A* A. Of course one also has 

Mil = \\A*A\\ 1/2 . 

25.2.1 Matrix Algebras 

It will be useful to study the finite dimensional case in order to gain experience 
and put the general case in perspective. 


25.3 Jordan-Lie Algebras 

In this section we will not assume that algebras are necessarily associative or 
commutative. We also do not always require an algebra to have a unity. If we 
consider the algebra M nx „(R) of n x n matrices and define a new product o by 

AoB ■= AB + BA ) 

then the resulting algebra M raxn (R),o is not associative and is an example of 
a so called Jordan algebra. This kind of algebra plays an important role in 
quantization theory. The general definition is as follows. 

Definition 25.4 A Jordan algebra 21, o is a commutative algebra such that 

Ao(BoA 2 ) = (AoB)oA 2 . 

In the most useful cases we have more structure which abstracts as the 
following: 

Definition 25.5 A Jordan Morphism h : 21, o — > 23,0 between Jordan algebras 
is a linear map satisfying h(A o B) = h(A) o h(B) for all A, B € 21. 

Definition 25.6 A Jordan-Lie algebra is a real vector space 21r which two 
bilinear products o and {., .} such that 
1 ) 

AoB = Bo A 
{A,B} = -{B,A} 

2) For each A € 21r the map B i— ► {A, B} is a derivation for both 21r,o and 

21k, {•, •}• 

3) There is a constant H such that we have the following associator iden- 
tity: 

{AoB)oC~ (AoB)oC= ^h 2 {{A, ,C},B}. 
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Observe that this definition actually says that a Jordan-Lie algebra is 
actually two algebras coupled by the associator identity and the requirements 
concerning the derivation property. The algebra 21r,o is a Jordan algebra and 
21 k, is a Lie algebra. 

Notice that if a Jordan-Lie algebra 21r is associative then by definition it is 
a Poisson algebra. 

Because we have so much structure we will be interested in maps which 
preserve some or all of the structure and this leads us the following definition: 

Definition 25.7 A Jordan (resp. Poisson) morphism h : 21r — » *Br between 
Jordan-Lie algebras is a Jordan morphism (resp. Lie algebra morphism) on the 
underlying algebras 21r,o and Q3r,o (resp. underlying Lie algebras 21r, {., .} and 
*Br,{., .} ). A map h : 21r — > 25r which is simultaneously a Jordan morphism 
and a Poisson morphism is called a (Jordan-Lie) morphism. In each case we 
also have the obvious definitions for isomorphism. 
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Chapter 26 


Appendices 


26.1 A. Primer for Manifold Theory 

After imposing rectilinear coordinates on a Euclidean space E n (such as the 
plane E 2 ) we identify Euclidean space with R n , the vector space of n — tuples of 
numbers. In fact, since a Euclidean space in this sense is an object of intuition 
(at least in 2d and 3d) some may insist that to be sure such a space of point 
really exists that we should in fact start with R n and “forget” the origin and 
all the vector space structure while retaining the notion of point and distance. 
The coordinatization of Euclidean space is then just a “remembering” of this 
forgotten structure. Thus our coordinates arise from a map x : E n — > R" 
which is just the identity map. This approach has much to recommend it and 
we shall more or less follow this canonical path. There is at least one regrettable 
aspect to this approach which is the psychological effect that occurs when we 
impose other coordinates on our system an introduce differentiable manifolds 
as abstract geometric objects that support coordinate systems. It might seem 
that this is a big abstraction and when the definitions of charts and atlases and 
so on appear a certain notational fastidiousness sets in that somehow creates 
a psychological gap between open set in R" and the abstract space that we 
coordinatize. But what is now lost from sight is that we have already been deal- 
ing with an abstract manifolds E n which we have identified with R” but could 
just as easily supports other coordinates such as spherical coordinates. What 
competent calculus student would waste time thinking of polar coordinates as 
given a map E 2 — > R 2 (defined on a proper open subset of course) and then 
wonder whether something like drd<j)d6 lives on the original E n or in the image 
of the coordinate map E n — > R ra ? It has become an unfortunate consequence of 
the modern viewpoint that simple geometric ideas are lost from the notation. 
Ideas that allow one to think about “quantities and their variations” and then 
comfortably write things like 


rdr A dO = dx A dy 
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without wondering if it shouldn’t be a “pullback” ip* 2 (rdr Add) = dxAdy where 
ipi 2 is the change of coordinate map R 2 — > R 2 given by 

x(r,6) = r cos 9 
y(r, 9) = r cos 9 

Of course, the experienced differential geometer understands the various mean- 
ing and the contextual understanding which removes ambiguity. The student, 
on the other hand, is faced with a pedagogy that teaches notation, trains one to 
examine each, equation for logical self consistence, but fails to teach geometric 
intuition. Having made this complain the author must confess that he too will 
use the modern notation and will not stray far from standard practice. These 
remarks are meant to encourage the student to stop and seek the simplest most 
intuitive viewpoint whenever feeling overwhelmed by notation. The student is 
encouraged to experiment with abbreviated personal notation when checking 
calculations and to draw diagrams and schematics that encode the geometric 
ideas whenever possible. “The picture writes the equations”. 

So, as we said, after imposing rectilinear coordinates on a Euclidean space 
E n (such as the plane E 2 ) we identify Euclidean space with R ra , the vector space 
of n— tuples of numbers. We will envision there to be a copy R™ of R" at each of 
its points p £ R". The elements of R™ are to be though of as the vectors based 
at p, that is, the “tangent vectors”. These tangent spaces are related to each 
other by the obvious notion of vectors being parallel (this is exactly what is not 
generally possible for tangents spaces of a manifold). For the standard basis 
vectors e.j (relative to the coordinates Xi) taken as being based at p we often 

write and this has the convenient second interpretation as a differential 

x ' p 

operator acting on smooth functions defined near p £ R n . Namely, 

d f _ df_, x 

dxi p dx.i 

An n-tuple of smooth functions X 1 ,...,^" defines a smooth vector field X = 
whose value at p is ^ X l (p) . Thus a vector field assigns to each 

* 1 p 

p in its domain, an open set U, a vector X l {p) JA- at p. We may also think 

' p 

of vector field as a differential operator via 

f„Xf£C°°(U) 

(Xf)(p) :=£*Xp)|£(p) 

Example 26.1 X = y-^A — xJ^ is a vector field defined on U = R 2 — {0} and 
(Xf)(x,y) = y%(x,y)-x^(x,y). 

Notice that we may certainly add vector fields defined over the same open 
set as well as multiply by functions defined there: 

( fX + gY){p ) = f(p)X(p) + g(p)X(p) 
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The familiar expression df = jjJ-dx-i + • • • + -g^-dx n has the intuitive inter- 
pretation expressing how small changes in the variables of a function give rise 
to small changes in the value of the function. Two questions should come to 
mind. First, “what does ‘small’ mean and how small is small enough?” Second, 
“which direction are we moving in the coordinate” space? The answer to these 
questions lead to the more sophisticated interpretation of df as being a linear 
functional on each tangent space. Thus we must choose a direction v p at p £ R" 
and then df{v p ) is a number depending linearly on our choice of vector v p . The 
definition is determined by dxi(ej) = Sij. In fact, this shall be the basis of our 
definition of df at p. We want 

DI Al t [>=-£<->■ 

Now any vector at p may be written v p = ^" =1 v l which invites us to 

x ‘ p 

use Up as a differential operator (at p): 

v vf eR 


This consistent with our previous statement about a vector field being a differ- 
ential operator simply because X ( p ) = X p is a vector at p for every p £ U. This 
is just the directional derivative. In fact we also see that 

A/Ip (v P ) = Y Ur.(p) dx 3 (Y yl xx. ] 


= Y vi n!rW = v rf 


so that our choices lead to the following definition: 


Definition 26.1 Let f be a smooth function on an open subset U o/R". By 
the symbol df we mean a family of maps Df | with p varying over the domain 
U of f and where each such map is a linear functional of tangent vectors based 
at p given by Df\ p (v p ) = v p f = £" =1 v l i ^-(p). 

Definition 26.2 More generally, a smooth 1-form a on U is a family of linear 
functionals a p : T p R” — * R with p £ U which is smooth is the sense that 

a p( ITT ) a smooth function of p for all i. 

' p 

From this last definition it follows that if X = X 1 JD is a smooth vector field 

OXi 

then a(X)(p) a p (X p ) defines a smooth function of p. Thus an alternative 
way to view a 1— form is as a map a : X i— > a(X) which is defined on vector 
fields and linear over the algebra of smooth functions C°°(U ) : 

at(fX + gY) = fa(X) + ga(Y). 
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26.1.1 Fixing a problem 

Now it is at this point that we want to destroy the privilege of the rectangular 
coordinates and express our objects in an arbitrary coordinate system smoothly 
related to the existing coordinates. This means that for any two such coordinate 
systems, say w 1 , u n and y 1 , y n we want to have the ability to express fields 
and forms in either system and have for instance 


yi 

' {y) d Vi 


= X = XI 


(M) dm 


for appropriate functions X l (y y X l (u y This equation only makes sense on the 
overlap of the domains of the coordinate systems. To be consistent with the 
chain rule we must have 

d du j d 

dy l dy l dvd 

which then forces the familiar transformation law: 


V du 3 X i - X‘ 
zL ~d^ x (v) ~ A («) 


We think of X^ and X^ as referring to or representing the same geometric 
reality from two different coordinate systems. No big deal right? We how about 
the fact, that there is this underlying abstract space that we are coordinatizing? 
That too is no big deal. We were always doing it in calculus anyway. What about 
the fact that the coordinate systems aren’t defined as a 1-1 correspondence with 
the points of the space unless we leave out some point in some coordinates like we 
leave out the origin to avoid ambiguity in 6 and have a nice open domain. Well 
if this is all fine then we may as well imagine other abstract spaces that support 
coordinates in this way. In fact, we don’t have to look far for an example. 
Any surface such as the sphere will do. We can talk about 1-forms like say 
a = Qd</> + </>sin (9)d0, or a vector held tangent to the sphere 0sin((j))Jjg + O 2 -^ 
and so on (just pulling things out of a hat). We just have to be clear about 
how these arise and most of all how to change to a new coordinate expression 
for the same object. This is the approach of tensor analysis. An object called a 
2-tensor T is represented in two different coordinate systems as for instance 


'W T >:i JL 

f ( y ) 


d 

dy j 




) du 1 


d 

dui 


where all we really need to know for many purposes the transformation law 


T*] _ 

1 (v) ~ 


v- r rs d V l dy l 
^ {u) du r du s ‘ 


Then either expression is referring to the same abstract tensor T. This is just 
a preview but it highlight the approach wherein a transformation laws play a 
defining role. 
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26.2 B. Topological Spaces 

In this section we briefly introduce the basic notions from point set topology 
together with some basic examples. We include this section only as a review 
and a reference since we expect that the reader should already have a reasonable 
knowledge of point set topology. In the Euclidean coordinate plane R” consisting 
of all n-tuples of real numbers (xi,x 2 , ■■■x n ) we have a natural notion of distance 
between two points. The formula for the distance between two points p\ = 
(xi,X 2 ,—x n ) and p 2 = (yi, y 2 , —,y n ) is simply 

d(pi,P 2 ) = - yi) 2 . (26.1) 

The set of all points of distance less than e from a given point po in the plain 
is denoted B(po,e), i.e. 

B(po,e) = {p € R" : d(p,p 0 ) < e}. (26.2) 

The set B(p 0 , e ) is call the open ball of radius e and center p 0 . A subset S of R 2 
is called open if every one of its points is the center of an open ball completely 
contained inside S. The set of all open subsets of the plane has the property 
that the union of any number of open sets is still open and the intersection of 
any finite number of open sets is still open. The abstraction of this situation 
leads to the concept of a topological space. 

Definition 26.3 A set X together with a family ‘X of subsets of X is called a 
topological space if the family X has the following three properties. 

1. X € X and 0 € X. 

2. IfUi and U 2 are both in X then U\ D U 2 € X also. 

3. If {U a } a( zA is any sub-family of X indexed by a set A then the union 
UaeA U a is also in X. 

In the definition above the family of subsets X is called a topology on X 
and the sets in X are called open sets. The compliment U c := X \ U of an 
open set U is called closed set. The reader is warned that a generic set may 
be neither open nor closed. Also, some subsets of X might be both open and 
closed (consider X itself and the empty set) . A topology X 2 is said to be finer 
than a topology Xi if Xi C X 2 and in this case we also say that Xi is coarser 
than X 2 . We also say that the topology Xi is weaker than X 2 and that X 2 is 
stronger than Xi. 

Neither one of these topologies is generally very interesting but we shall soon 
introduce much richer topologies. A fact worthy of note in this context is the 
fact that if A, X is a topological space and S C X then S inherits a topological 
structure from X. Namely, a topology on S (called the relative topology) is 
given by 

Xs = {all sets of the form SOT where T £ X} 

In this case we say that S' is a topological subspace of X. 


(26.3) 
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Definition 26.4 A map between topological spaces f : X — > Y is said to be 
continuous at p £ X if for any open set O containing f(p) there is an open 
set U containing p £ X such that f(U) C O. A map f : X — > Y is said to be 
continuous if it is continuous at each point p £ X. 

Proposition 26.1 / : X — > Y is continuous iff / -1 (0) is open for every open 
set O C Y. 

Definition 26.5 A subset of a topological space is called closed if it is the 
compliment of an open set. 

Closed sets enjoy properties complimentary to those of open sets: 

1. The whole space X and the empty set 0 are both closed. 

2. The intersection of any family of closed sets is a closed set. 

3. The union of a finite number of closed sets is closed. 

Since the intersection of closed sets is closed every set S C X is contained in 
a closed set which is the smallest of all closed sets containing S which is called 
the closure of S and is denoted by S. The closure S is the intersection of all 
closed subsets containing S: 

n 

SdF 

Similarly, the interior of a set S is the largest open set contained in S and is 

O 

denoted by S. A point p £ S C X is called an interior point of S if there is 
an open set containing p and contained in S. The interior of S is just the set of 

O 

all its interior points, ft may be shown that S = ( S c ) c 

Definition 26.6 The (topological) boundary of a set S C X is dS := S fl S c 
and 

We say that a set S C A is dense in X if S = X. 

Definition 26.7 A subset of a topological space X,T is called cloven if it is 
both open and closed. 

Definition 26.8 A topological space X is called connected if it is not the 
union of two proper cloven set. Here, proper means not X or 0. A topological 
space X is called path connected if for every pair of points p,q £ X there is a 
continuous map c : [a, b] — > X (a path) such that c(a) = q and c(b ) = p. (Here 
[a, b] C R. is endowed with the relative topology inherited from the topology on 
R.j 

Example 26.2 The unit sphere S 2 is a topological subspace of the Euclidean 
space R 3 . 
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Let X be a set and {T a .} a &A any family of topologies on X indexed by some 
set A. The the intersection 

1 = p) T q 

a€A 

is a topology on X. Furthermore, T is coarser that every T a . 

Given any family $ of subsets of X there exists a weakest (coarsest) topology 
containing all sets of S'. We will denote this topology by T(S)- 

One interesting application of this is the following; Given a family of maps 
{fa} from a set 5 to a topological space Y,Ty there is a coarsest topology on 
S such that all of the maps f a are continuous. This topology will be denoted 
T {/o} and is called the topology generated by the family of maps {/«}. 

Definition 26.9 If X and Y are topological spaces then we define the product 
topology on XxY as the topology generated by the projections pr\ : XxY — > X 
and pr 2 : X xY — > Y. 

Definition 26.10 If tt : X — > Y is a surjective map where X is a topological 
space but Y is just a set. Then the quotient topology is the topology generated 
by the map tt. In particular, if Ad X we may form the set of equivalence classes 
X/A where x ~ y if both are in A or they are equal. The the map x i— > [x] is 
surjective and so we may form the quotient topology on X/A. 

Let X be a topological space and x £ X. A family of subsets B x all of 
which contain x is called an open neighborhood base at x if every open set 
containing x contains (as a superset) an set from B x . If X has a countable open 
base at each x £ X we call X first countable. 

A subfamily B is called a base for a topology T on X if the topology T is 
exactly the set of all unions of elements of B. If X has a countable base for its 
given topology we say that A is a second countable topological space. 

By considering balls of rational radii and rational centers one can see that 
R" is first and second countable. 

26.2.1 Separation Axioms 

Another way to classify topological spaces is according to the following scheme: 

(Separation Axioms) 

A topological space X, T is called a T 0 space if given x, y € X, x ^ y, there 
exists either an open set containing x, but not y or the other way around (We 
don’t get to choose which one). 

A topological space X , T is called Ti if whenever given any x,y £ X there is 
an open set containing x but not y (and the other way around;we get to do it 
either way). 

A topological space X , T is called T 2 or Hausdorff if whenever given any 
two points x, y £ X there are disjoint open sets U\ and U 2 with x £ U\ and 

y e u 2 . 



364 


CHAPTER 26. APPENDICES 


A topological space X, T is called X3 or regular if whenever given a closed 
set F C X and a point x € X\F there are disjoint open sets U\ and U 2 with 
x £ Ui and F C U 2 

A topological space X, T is called T4 or normal if given any two disjoint 
closed subsets of X, say F\ and F 2 . there are two disjoint open sets U\ and U 2 
with F\ C U\ and F 2 CU 2 . 

Lemma 26.1 (Urysohn) Let X be normal and F,G C X closed subsets with 
F fl G = 0. Then there exists a continuous function f : X — > [0, 1] C ffi. such 
that f(F) = 0 and f(G) = 1. 

A open cover of topological space X (resp. subset S C X) a collection of 
open subsets of X, say{I/ a }, such that X = \JU a (resp. S C (J U a ). For 
example the set of all open disks of radius e > 0 in the plane covers the plane. 
A finite cover consists of only a finite number of open sets. 

Definition 26.11 A topological space X is called compact if every open cover 
of X can be reduced to a finite open cover by eliminating some ( possibly an 
infinite number) of the open sets of the cover. A subset S C X is called compact 
if it is compact as a topological subspace (i.e. with the relative topology). 

Proposition 26.2 The continuous image of a compact set is compact. 

26.2.2 Metric Spaces 

If the set X has a notion of distance attached to it then we can get an associated 
topology. This leads to the notion of a metric space. 

A set X together with a function d : X x X — > K. is called a metric space 
if 

d(x, x) > 0 for all x £ X 
d(x, y) = 0 iff x = y 

d(x,z) < d(x,y) + d(y,z) for any x,y,z € X ( this is called the triangle 
inequality). 

The function d is called a metric or a distance function. 

Imitating the situation in the plane we can define the notion of an open ball 
B(po, e) with center po and radius e. Now once we have the metric then we have 
a topology; we define a subset of a metric space X, d to be open if every point 
of S is an interior point where a point p £ S is called an interior point of S 
if there is some ball B(p,e) with center p and (sufficiently small) radius e > 0 
completely contained in S. The family of all of these metrically defined open 
sets forms a topology on X which we will denote by T c ;. It is easy to see that 
any B(p , e ) is open according to our definition. 

If / : X, d — ► Y, p is a map of metric spaces then / is continuous at x £ X if 
and only if for every e > 0 there is a, 5(e) > 0 such that if d(x',x) < 6 (e) then 
P(f(x')j(x)) < e. 
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Definition 26.12 A sequence of elements Xi,X 2 , of a metric space X,d 

is said to converge to p if for every e > 0 there is an N(e) > 0 such that 

if k > N(e) then Xk G B(p,e) . A sequence X\,X 2 , is called a Cauchy 

sequence is for every e > 0 there is an N(e) > 0 such that if k, l > N(e) 
then d(xk,Xi) < e. A metric space X,d is said to be complete if every Cauchy 
sequence also converges. 

A map / : X, d — > Y, p of metric spaces is continuous at x G X if and only 
if for every sequence Xi converging to x, the sequence yi := f(xf) converges to 

f( x )- 

26.3 C. Topological Vector Spaces 

We shall outline some of the basic definitions and theorems concerning topolog- 
ical vector spaces. 

Definition 26.13 A topological vector space (TVS) is a vector space V with 
a Hausdorff topology such that the addition and scalar multiplication operations 
are (jointly) continuous. 

Definition 26.14 Recall that a neighborhood of a point p in a topological space 
is a subset which has a nonempty interior containing p. The set of all neighbor- 
hoods that contain a point x in a topological vector space V is denoted A f(x) . 

The families A f{x) for various x satisfy the following neighborhood axioms 

1. Every set which contains a set from Af(x) is also a set from Af(x) 

2. If Ni is a family of sets from Af(x) then P| - Ni G A f(x) 

3. Every N G A f(x) contains x 

4. If V G A f(x) then there exists W G Af (x) such that for all y G W, 
V G A f(y). 

Conversely, let X be some set. If for each x G X there is a family A f(x) of 
subsets of X that satisfy the above neighborhood axioms then there is a uniquely 
determined topology on X for which the families Af(x) are the neighborhoods 
of the points x. For this a subset U C X is open iff for each x G U we have 
U G Af (x). 

Definition 26.15 A sequence x n in a TVS is call a Cauchy sequence iff for 
every neighborhood U of 0 there is a number Nu such that Xi — Xk G U for all 
k, l > Njj . 

Definition 26.16 A relatively nice situation is when V has a norm which in- 
duces the topology. Recall that a norm is a function |||| : v e- > ||w|| g R defined 
on V such that for all v, w G V we have 
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1. ||v|| > 0 and j|u|| = 0 iff v = 0, 

2. \\v + w\\ < ||v|| + \\w\\, 

3. ||av|| = |a| ||u|| for all a£l. 

In this case we have a metric on V given by dist(r>, tu):=||i> — ie|| . A semi- 
norm is a function || || : v i— > ||v|| £ R. such that 2) and 3) hold but instead 
of 1) we require only that ||u|| > 0. 

Definition 26.17 A normed space V is a TVS which has a metric topology 
given by a norm. That is the topology is generated by the family of all open balls 

B y (x,r) :={xeV: ||x|| < 0}. 

Definition 26.18 A linear map I : V — ■> W between normed spaces is called 
bounded iff there is a constant C such that for all v € V we have ||A/|| W < 
C || v|| v . If I is bounded then the smallest such constant C is 

IKII : = su p xth = su p{INIw : ll v l!v < !} 

II v ll V 

The set of all bounded linear maps V — » W is denoted £>(V, W). The vector 
space B (\/ , W) is itself a normed space with the norm given as above. 

Definition 26.19 A locally convex topological vector space V is a TVS 

such that it’s topology is generated by a family of seminorms {||.|| a } a . This 
means that we give V the weakest topology such that all ||.|| a are continuous. 
Since we have taken a TVS to be Hausdorff we require that the family of semi- 
norms is sufficient in the sense that for each x € V we have p|{a; : IML = 
0} = 0. A locally convex topological vector space is sometimes called a locally 
convex space and so we abbreviate the latter to LCS. 

Example 26.3 Let O be an open set in K™ or any manifold. For each x £ O 
define a seminorm p x on C(Q) by p x (f) = f(x). This family of seminorms 
makes C(Cl) a topological vector space. In this topology convergence is pointwise 
convergence. Also, C(Ct) is not complete with this TVS structure. 

Definition 26.20 An LCS which is complete (every Cauchy sequence converges) 
is called a Frechet space. 

Definition 26.21 A complete normed space is called a Banach space. 

Example 26.4 Suppose that X,p is a a -finite measure space and let p > 1. 
The set L p (X,p) of all with respect to measurable functions f : X — * C such 
that J \f\ p dp. <oo is a Banach space with the norm ||/|| := (/ |/| p dp) l ^ v . 
Technically functions equal almost everywhere dp must be identified. 

Example 26.5 The space Cb(fl) of bounded continuous functions on O is a 
Banach space with norm given by H/H^ := sup^g^ |/(a;)|. 
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Example 26.6 Once again let O be an open subset o/R". For each compact 
K CC O we have a seminorm on C(Q) defined by f i— > \\f\\ K := sup x£if \f(x)\. 
The corresponding convergence is the uniform convergence on compact subsets 
ofLl. It is often useful to notice that the same topology can be obtained by using 
||/|| x . obtained from a countable sequence of nested compact sets K\ C K 2 C ... 
such that 

U K n = Q. 

Such a sequence is called an exhaustion ofLl. 

If we have topological vector space V and a closed subspace S, then we can 
form the quotient V /S. The quotient can be turned in to a normed space by 
introducing as norm 

IIMIIv/s : = inf INI- 

v£ [ccj 

If S is not closed then this only defines a seminorm. 

Theorem 26.1 If V is Banach space and a closed subspace S a closed (linear) 
subspace then V/S is a Banach space with the above defined norm. 

Proof. Let x n be a sequence in V such that [x n \ is a Cauchy sequence in 

V/S. Choose a subsequence such that \\[x n ] — [x n +i]|| < 1/2” for n = 1,2, 

Setting si equal to zero we find s 2 € S such that ||a;i — (x 2 + S 2 ) || and continuing 
inductively define a sequence s, such that such that {x n + s„} is a Cauchy 
sequence in V. Thus there is an element y € V with x n + s n — > y. But since 
the quotient map is norm decreasing the sequence [x n + s„] = [x n ] must also 
converge; 

[x n \ -> [y]- 


Remark 26.1 It is also true that if S is closed and V/S is a Banach space then 
so is V. 

26.3.1 Hilbert Spaces 

Definition 26.22 A Hilbert, space H is a complex vector space with a Hermi- 
tian inner product (., .). A Hermitian inner product is a bilinear form with the 
following properties: 

1 ) ( v , w) = ( v , w) 

2 ) ( v , awi + ( 3 w 2 ) = a(v, nq) + /3(v, w 2 ) 

3) ( v , v) > 0 and ( v , v) = 0 only if v = 0. 

One of the most fundamental properties of a Hilbert space is the projection 
property. 
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Theorem 26.2 If K is a convex, closed subset of a Hilbert space H. then for 
any given x £ Ti there is a unique element pk{x) £ H which minimizes the 
distance ||x — y\\over y £ K. That is 

\\x-p K {x)\\ = inf ' \\x- y\\ . 

yGK 

If K is a closed linear subspace then the map x i— > p K ( x) is a bounded linear 
operator with the projection property p 2 K = pk- 

Definition 26.23 For any subset S £ ft we have the orthogonal compliment 
S 1 - defined by 

S ± = {x £ TL : (x, s) = 0 for all s £ Sj. 

S ± is easily seen to be a linear subspace of H. Since l s : x e- > (x, s) is 
continuous for all s and since 


s ± = njf 1 (0) 

we see that S 1 - is closed. Now notice that since by definition 
||ar - P s x|| 2 < \\x - P s x - As|| 2 

for any s £ S and any real A we have ||ar — P s a;||“ < \\x — P s x\\ 2 —2\(x — P s x, s) + 
A 2 ||s|| 2 . Thus we see that p( A) := ||ar — P s ;r|| 2 — 2X(x — P s x, s) + A 2 ||s|| 2 is a 
polynomial in A with a minimum at A = 0. This forces (x — P s x, s) = 0 and 
so we see that x — P s x. From this we see that any x £ H can be written 
as x = x — P s x + P s x = s + s 2 -. On the other hand it is easy to show that 
S' 1 fl S = 0. Thus we have H = S ® S- 1 for any closed linear subspace S C H. 
In particular the decomposition of any lass+^gS® S 1 - is unique. 

26.3.2 Orthonormal sets 

26.4 D. Overview of Classical Physics 

26.4.1 Units of measurement 

In classical mechanics we need units for measurements of length, time and mass. 
These are called elementary units. WE need to add a measure of electrical 
current to the list if we want to study electromagnetic phenomenon. Other 
relevant units in mechanics are derived from these alone. For example, speed 
has units of length x time -1 , volume has units of length x length x length kinetic 
energy has units of mass x length x length x length x time - 1 x time -1 and so on. 
A common system, called the SI system uses meters (m), kilograms (km) and 
seconds (sec) for length, mass and time respectively. In this system, the unit of 
energy kgxm 2 sec -2 is called a joule. The unit of force in this system is Newton’s 
and decomposes into elementary units as kgxmxsec -2 . 
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26.4.2 Newton’s equations 

The basic assumptions of Newtonian mechanics can be summarized by saying 
that the set of all mechanical events M taking place in ordinary three dimen- 
sional space is such that we can impose on this set of events a coordinate system 
called an inertial coordinate system. An inertial coordinate system is first of all 
a 1-1 correspondence between events and the vector space IxM 3 consisting of 
4-tuples ( t,x,y,z ). The laws of mechanics are then described by equations and 
expressions involving the variables ( t,x,y,z ) written (f, x) where ~x.={x,y, z). 
There will be many correspondences between the event set and R x R 3 but not 
all are inertial. An inertial coordinate system is picked out by the fact that 
the equations of physics take on a particularly simple form in such coordinates. 
Intuitively, the x, y, z variables locate an event in space while t specifies the 
time of an event. Also, x, y,z should be visualized as determined by measur- 
ing against a mutually perpendicular set of three axes and t is measured with 
respect to some sort of clock with t = 0 being chosen arbitrarily according to 
the demands of the experimental situation. Now we expect that the laws of 
physics should not prefer any particular such choice of mutually perpendicular 
axes or choice of starting time. Also, the units of measurement of length and 
time are conventionally determined by human beings and so the equations of 
the laws of physics should in some way not depend on this choice in any sig- 
nificant way. Careful consideration along these lines leads to a particular set 
of “coordinate changes” or transformations which translate among the different 
inertial coordinate systems. The group of transformations which is chosen for 
classical (non-relativistic) mechanics is the so called Galilean group Gal. 

Definition 26.24 A map j:Rxl 3 -+KxR 3 ts called a Galilean transforma- 
tion iff it can be decomposed as a composition of transformations of the following 
type: 

1. Translation of the origin: 


(f,x) I— + t 0 , X + x 0 ) 

2. Uniform motion with velocity v: 

(f,x) i— > (f,x+iv) 

3. Rotation of the spatial axes: 


(t, x) i— > (t, f?x) 


where il £ 0(3). 

If (t, x) are inertial coordinates then so will (T, X) be inertial coordinates iff 
(T, X) =g(t,x) for some Galilean transformation. We will take this as given. 

The motion of a idealized point mass moving in space is described in an 
inertial frame (t, x) as a curve 1 1— > c(t) £ R 3 with the corresponding curve 1 1— > 
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( t,c(t )) in the (coordinatized) event space Rxl 3 . We often write x(t) instead 
of c(t) . If we have a system of n particles then we may formally treat this as a 
single particle moving in an 3n— dimensional space and so we have a single curve 
in R 3n . Essentially we are concatenating the spatial part of inertial coordinates 
R 3 " = K 3 x • • • K 3 taking each factor as describing a single particle in the system 
so we take x = (xi,yi,z±, x n , y n , z n ). Thus our new inertial coordinates may 
be thought of as M x R 3n . If we have a system of particles it will be convenient to 
define the momentum vector p = (miXi, m\y\, m\Zi, ....,m n x n ,m n y n ,m n z n ) £ 
R 3n . In such coordinates, Newton’s law for n particles of masses 


reads 


d 2 p 
dt 2 


F(x(f),f) 


where t i— > x(f) describes the motion of a system of n particles in space as a 
smooth path in M 3n parameterized by t representing time. The equation has 
units of force (Newton’s in the SI system). If all bodies involved are taken into 
account then the force F cannot depend explicitly on time as can be deduced 
by the assumption that the form taken by F must be the same in any inertial 
coordinate system. We may not always be able to include explicitly all involved 
bodies and so it may be that our mathematical model will involve a changing 
force F exerted on the system from without as it were. As an example consider 
the effect of the tidal forces on sensitive objects on earth. Also, the example of 
earths gravity shows that if the earth is not taken into account as one of the 
particles in the system then the form of F will not be invariant under all spatial 
rotations of coordinate axes since now there is a preferred direction (up-down) . 


26.4.3 Classical particle motion in a conservative field 

There are special systems for making measurements that can only be identified 
in actual practice by interaction with the physical environment. In classical 
mechanics, a point mass will move in a straight line unless a force is being 
applied to it. The coordinates in which the mathematical equations describing 
motion are the simplest are called inertial coordinates (a;, y, z, t). If we consider 
a single particle of mass in then Newton’s law simplifies to 

d 2 x _. . . . 

m-^ = F(x(t), t) 

The force F is conservative if it doesn’t depend on time and there is a potential 
function V : R 3 — > R such that F(x) = — gradF(x). Assume this is the case. 
Then Newton’s law becomes 

d 2 

m—^x(t) + grad F(x(f)) = 0. 

Newton’s equations are often written 


F(x(<)) = ma(t) 
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F :R 3 — » R 3 is the force function and we have taken it to not depend explicitly 
on time t. The force will be conservative so F(x) = — grad V (x) for some scalar 
function F(x). The total energy or Hamiltonian function is a function of two 
vector variables x and v given (in this simple situation) by 

#( x , v ) = ^ TO II v|| 2 + V(x) 


so that if we plug in x = x(t) and v = x'(t) for the motion of a particle then 
we get the energy of the particle. Since this is a conservative situation F(x) = 

— grad V (x) we discover by differentiating and using equation ?? that 4iJ(x(i), x'(f)) 
0. This says that the total energy is conserved along any path which is a solution 
to equation ?? as long as F(x) = — gradF(x). 

There is a lot of structure that can be discovered by translating the equations 
of motion into an arbitrary coordinate system {q 3 ,q 2 ,q 3 ) and then extending 
that to a coordinate system {q l , q 2 , q 3 , q 1 , q 2 , q 3 ) for velocity space R 3 x R 3 . 
Here, q 3 ,q 2 , q 3 are not derivatives until we compose with a curve R — ■> R 3 to 
get functions of t. Then (and only then) we will take q l {t), q 2 (t),q 3 (t) to be the 
derivatives. Sometimes (q 1 (t ) , q 2 (t ) , q 3 (t)) is called the generalized velocity 
vector. Its physical meaning depends on the particular form of the generalized 
coordinates. 

In such a coordinate system we have a function L(q, q) called the Lagrangian 
of the system. Now there is a variational principle that states that if q(f) is 
a path which solve the equations of motion and defined from time t\ to time 
t ‘2 then out of all the paths which connect the same points in space at the same 
times t-\ and O , the one that makes the following action the smallest will be 
the solution: 

S(q(t)) = [ L(q,q)dt 

Jti 

Now this means that if we add a small variation to q get another path q + <5q 
then we calculate formally (and rigorously later in the notes): 


5S(q(t)) = S / T(q, q)dt 

Jt i 

rt-2 


f ti 


Sq-^L^cd+dq-^Li^q) 


dt 


d 


d d 


l tl * [dq LM) ~ dtM LM) ' dt ' 


1 *2 


*q-^( q , q) 


If our variation is among those that start and end at the same space-time loca- 
tions then <5q = 0 is the end points so the last term vanishes. Now if the path 
q(t) is stationary for such variations then <55(q(t)) = 0 so 




L{ q, q) 


d d 
dt dq 


L( q, q) 


dt = 0 
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and since this is true for all such paths we conclude that 


d_ 

dq 


L{ q,q) 


d_ d_ 
dt dq 


L(q, q) = 0 


or in indexed scalar form 


dL_ 

dq 1 


A|^ =0 for 1 < i < 3 
dt dq 1 ~ ~ 


on a stationary path. This is (these are) the Euler-Lagrange equation(s). If 
q were just rectangular coordinates and if L were | to || v || 2 — E(x) this turns 
out to be Newton’s equation. Notice, the minus sign in front of the V. 


Definition 26.25 For a Lagrangian L we can associate the quantity E = Y) 

L{ q,q). 


Let us differentiate E. We get 


d 

dt 


E = 


d x - dL 
dt dq 1 
dL d , 


q l - L( q, q) 


, ■ d dL d 

Wdt q - 9 dtaf - S L(q ' q) 


dL d 


d dL dL 


dq 1 d^ ^ dt dq 1 


dq 1 ^ 


9Id„ 

dq 1 dt ^ 


0 by the Euler Lagrange equations. 


(26.4) 


Conclusion 26.1 If L does not depend explicitly on time; ^ = 0, then the 
energy E is conserved ; = 0 along any solution of the Euler-Lagrange 

equations. . 

But what about spatial symmetries? Suppose that JjEL = 0 for one of the 
coordinates q l . Then if we define pi = we have 

d _ d dL _ d 
dt Pt dt dq 1 dq 1 

so Pi is constant along the trajectories of Euler’s equations of motion. The 
quantity Pi = ^ is called a generalized momentum and we have reached the 
following 

Conclusion 26.2 If J=LL = 0 thenpi is a conserved quantity. This also applies 
if FqL = ( J^f , ■■■> Jy^) = 0 with the conclusion that the vector p = f^L = 
(J^-,..., Jjpj) is conserved (each component separately). 

Now let us apply this to the case a free particle. The Lagrangian in rectan- 
gular inertial coordinates are 

£( x , x ) = |x| 2 



26.4. D. OVERVIEW OF CLASSICAL PHYSICS 


373 


and this Lagrangian is symmetric with respect to translations xnx + c 

L(x + c, x) = L(x, x) 

and so the generalized momentum vector for this is p =mx each component of 
which is conserved. This last quantity is actually the usual momentum vector. 

Now let us examine the case where the Lagrangian is invariant with respect 
to rotations about some fixed point which we will take to be the origin of an 
inertial coordinate system. For instance suppose the potential function V (x) is 
invariant in the sense that V(x) = V(Ox) for any orthogonal matrix O. The we 
can take an antisymmetric matrix A and form the family of orthogonal matrices 
e sA . The for the Lagrangian 

£(x,x) = ^m|x| 2 - V(x) 

we have 

b"*l 2 - F(e ‘" x » 

= ^(^ TO I *| 2 - y ( x )) = 0 

On the other hand, recall the result of a variation <5q 

*>• - j,-k u ^£ dt+ 

what we have done is to let <5q =Aq since to first order we have e s ' 4 q = 1 + sAq. 
But if q(f) satisfies Euler’s equation then the integral above is zero and yet the 
whole variation is zero too. We are led to conclude that 

T d l * 2 

<5q-wr-b(q,q) =o 

L c'q J tl 

which in the present case is 

j 4 x ‘^(^ m l A | 2_y ( x) ) = ° 

J ti 

[mix • x]j 2 =0 

for all <2 and t-\ . Thus the quantity mAx ■ x is conserved. Let us apply this 
with A equal to the following in turn 

' 0-10 

A= 1 0 0 

0 0 0 


then we get mAx ■ x =m(—x 2 , x 1 , 0) • (a: 1 , x 2 , x 3 ) — m(x 1 x 2 — x 3 x 2 ) which is the 
same as mx xk = px k which is called the angular momentum about the k 
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axis ( k = (0, 0, 1) so this is the z-axis) and is a conserved quantity. To see the 
point here notice that 


e 


tA 


cos t — sin t 0 
sin t cos £ 0 

0 0 1 


is the rotation about the z axis. We can do the same thing for the other two 
coordinate axes and in fact it turns out that for any unit vector u the angular 
momentum about that axis defined by p x u is conserved. 


Remark 26.2 We started with the assumption that L was invariant under all 
rotations O but if it had only been invariant under counterclockwise rotations 
about an axis given by a unit vector u then we could still conclude that at least 
p x u is conserved. 

Remark 26.3 Let begin to use the index notation (like q\pi and x l etc.) a 
little more since it will make the transition to fields more natural. 

Now we define the Hamiltonian function derived from a given Lagrangian 
via the formulas 


#(q,p) = 5Z P’i 1 - L( q,q) 
dL 

Pi ~ M 


where we think of q as depending on q and p via the inversion of pi = . Now 

it turns out that if q(£),q(£) satisfy the Euler Lagrange equations for L then 
q(£) and p(£) satisfy the Hamiltonian equations of motion 


dq l _ dH 
dt dpt 
dp 1 _ dH 
dt dq l 


One of the beauties of this formulation is that if Q l = Q l (q J ) are any other 
coordinates on R 3 and we define P l = then taking H(..q z ..p l ..) = 

H(..Q l ..Pi..) the equations of motion have the same form in the new co- 
ordinates. More generally, if Q,P are related to q,p in such a way that the 
Jacobian matrix J of the coordinate change ( on R 3 x R 3 ) is symplectic 


J* 


' 0 I 


' 0 

I ' 

-I 0 

J = 

-I 

0 


then the equations 26.4.3 will hold in the new coordinates. These kind of coordi- 
nate changes on the q , p space R 3 x R 3 (momentum space) are called canonical 
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transformations. Mechanics is, in the above sense, invariant under canonical 
transformations. 

Next, take any smooth function f{q,p) on momentum space (also called 
phase space). Such a function is called an observable. Then along any solution 
curve ( q(t),p(t )) to Hamilton’s equations we get 

df_ _ dj^dq df^dp 
dt, dq dt dp dt 
_ Of DH df dH 

dgi Qpi Qpi Qp 

= \f,H] 

where we have introduced the Poisson bracket [/, H] defined by the last equality 
above. So we also have the equations of motion in the form ^ = [f,H] for any 
function / not just the coordinate functions q and p. Later, we shall study a 
geometry hiding here; Symplectic geometry. 

Remark 26.4 For any coordinate t, x we will often consider the curve (x(f), x'(t)) £ 
R 3 " x R 3n the latter product space being a simple example of a velocity phase 
space. 

26.4.4 Some simple mechanical systems 

1. As every student of basic physics know the equations of motion for a par- 
ticle falling freely through a region of space near the earths surface where 
the force of gravity is (nearly) constant is x"(f) = —g k where k is the 
usual vertical unit vector corresponding to a vertical 2 -axis. Integrating 
twice gives the form of any solution x(f) = — ^gt 2 k+fvo + xo for constant 
vectors xo, Vo £ R 3 . We get different motions depending on the initial con- 
ditions (xo,vo). If the initial conditions are right, for example if Vo = 0 
then this is reduced to the one dimensional equation x"(t) = —g. The 
path of a solution with initial conditions (xo,i’o) is given in phase space 
as 

t (-- gt 2 + tv o + Xq, - gt + v 0 ) 

and we have shown the phase trajectories for a few initial conditions. 
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Phase portrait-falling object 


2. A somewhat general 1-dimensional system is given by a Lagrangian of the 
form 

L = ^a(q)q 2 - V(q) (26.5) 

and example of which is the motion of a particle of mass to along a 
1-dimensional continuum and subject to a potential V(x). Then the 
Lagrangian is L = ^mx 2 — V{x). Instead of writing down the Euler- 
Lagrange equations we can use the fact that E = — L(x,x) = 

mx 2 — (| mx 2 — V(x)) = \ mi 2 + V(x) is conserved. This is the total 
energy which is traditionally divided into kinetic energy ^mx 2 and poten- 
tial energy V(x). We have E = \mx 2 + V(x) for some constant. Then 

dx _ /2E-2V(x) 
dt V to 

and so 

t = J m/2 / = + c. 

J y/E - V{x) 

Notice that we must always have E — V (x) > 0. This means that if V(x) 
has a single local minimum between some points x = a and x = b where 
E — V = 0, then the particle must stay between x = a and x = b moving 
back and forth with some time period. What is the time period?. 


3. 


Central Field. A 

form F(x) = 
central field is 


central field is typically given by a potential of the 
Thus the Lagrangian of a particle of mass to in this 



where we have centered inertial coordinates at the point where the po- 
tential has a singularity lim x ^ 0 V (x) = ±oo. In cylindrical coordinates 
(r, 9, z ) the Lagrangian becomes 


1 

2 


m(r 2 + r 2 0 2 + z 2 ) 


k 

( r 2 + -2)1/2- 


We are taking q 1 = r , q 2 = 9 and q 3 = 9. But if initially z = z = 0 then 
by conservation of angular momentum discussed above the particle stays 
in the z = 0 plane. Thus we are reduced to a study of the two dimensional 
case: 

-m(r 2 + r 2 9 2 ) + 

2 r 

What are Lagrange’s equations? Answer: 

dL _ d dL 
dq 1 dt dq 1 

= mr9 2 — mrr 



26.4. D. OVERVIEW OF CLASSICAL PHYSICS 


377 


and 


dL d dL 

dq 2 dt. Dq 2 

= —mr 2 00. 

The last equation reaffirms that 9 = ixq is constant. Then the first equation 
becomes mruiQ — -% — mfr = 0. On the other hand conservation of energy 
becomes 


4. 


1 / -2 

-to r 
2 v 


on. ^ T-i f / . 

r w 0 ) + - = E 0 = -m(r { 
r 1 


2 2 \ ^ 
r o w o) + ~ 


•2 . 2 2 
r + r lo 0 


2k 


or 

2i?o 

TO 


,2 

5. A simple oscillating system is given by A# = —a: which has solutions of 
the form x ( t ) = Ci cos t + C 2 sin t. This is equivalent to the system 

x' = v 
v' = — x 


6. Consider a single particle of mass m which for some reason is viewed with 
respect to rotating frame and an inertial frame (taken to be stationary). 
The rotating frame (Ei(<), E 2 (<), E 3 (f)) = E (centered at the origin of R 3 ) 
is related to stationary frame (61^2,63) = e by an orthogonal matrix O : 

E (t) = O (f)e 

and the rectangular coordinates relative to these frames are related by 

x(t) = O (t)X(t) 
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We then have 


x(i) = 0(t)X + 0(t)X 

= o(t)(x + n(t)x) 

where Q(t) = 0*(f)0(t) is an angular velocity. The reason we have chosen 
to work with fl(t) rather than directly with O (t) will become clearer later 
in the book. Let us define the operator D t by D t X=X+f2(t)X. This 
is sometimes called the “total derivative”. At any rate the equations of 
motion in the inertial frame is of the form = f(x,x). In the moving 
frame this becomes an equation of the form 

m4(0(<)(X + ft(i)X)) =f(0(t)X, 0(i)(X + fi(t)X)) 
at 


and in turn 

0(i)4(X + n(i)X) + 6(i)(X+n(i)X) = m _1 f(0(t)X,0(t)(X+fi(t)X)). 
dt 

Now recall the definition of D t we get 


0(t)(^-D t X + S2(t)D t X ) = m _1 f (O(i)X, O(i)V) 
dt 


and finally 


mfi(X = F(X, V) 


(26.6) 


where we have defined the relative velocity V = X+f2(f)X and F(X, V) 
is by definition the transformed force f(0(t)X, O(f)V). The equation we 
have derived would look the same in any moving frame: It is a covariant 
expression. 


5. Rigid Body We will use this example to demonstrate how to work with the rotation 
group and it’s Lie algebra. The advantage of this approach is that it gen- 
eralizes to motions in other Lie groups and their algebra’s. Let us denote 
the group of orthogonal matrices of determinant one by SO( 3). This is the 
rotation group. If the Lagrangian of a particle as in the last example is in- 
variant under actions of the orthogonal group so that L(x, x) = L(Qx, Qx) 
for Q €. SO( 3) then the quantity t = x x mx is constant for the motion 
of the particle x = x(f) satisfying the equations of motion in the inertial 
frame. The matrix group SO(3) is an example of a Lie group which we 
study intensively in later chapters. Associated with every Lie group is its 
Lie algebra which in this case is the set of all anti-symmetric 3x3 matrices 
denoted so (3). There is an interesting correspondence between and R 3 
given by 

( 0 -lo 3 u) 2 \ 

lo 3 0 -u > i (u>i,U>2,U>3) = v 

-UJ2 Wl 0 J 
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Furthermore if we define the bracket for matrices A and B in so (3) by 
[A, B] = AB — BA then under the above correspondence [A, B] corre- 
sponds to the cross product. Let us make the temporary convention that 
if x is an element of R 3 then the corresponding matrix in so (3) will be 
denoted by using the same letter but a new font while lower case refers to 
the inertial frame and upper to the moving frame: 

x ^ x € so(3) and 
X X e so(3) etc. 


R 3 so(3) 
Inertial frame x ^ x 
Moving frame X ^ X 


Then we have the following chart showing how various operations match 
up: 


x=OX 


x= OXO 4 


vi x v 2 

V = X 

V =AX=X+ft(f)X 
I = x x mx 


£= OL 

D t L = L+fi(t) x L 


[vi , v 2 ] 
v=x 


V=AX = X+[0(t),X] 

I = [X,TOX] 

I = OLO 4 = [V,f 2(t)] 
AL = L+[fi(t),L] 


and so on. Some of the quantities are actually defined by their position 
in this chart. In any case, let us differentiate 1 = x x mx and use the 
equations of motion to get 


dl 

— = x x mx 
dt 

= x x mx + 0 

= x x f. 

But we have seen that if the Lagrangian (and hence the force f) is invariant 
under rotations that 37 = 0 along any solution curve. Let us examine this 
case. We have 37 = 0 and in the moving frame Z? t L=L+fI(£)L. Transfer- 
ring the equations over to our so(3) representation we have D t L=L+[fl(f), L] 
0. Now if our particle is rigidly attached to the rotating frame, that is, if 
x = 0 then X = 0 and V =[fi(f), X] so 

L = m[X, [fi(£),X]]. 

In Lie algebra theory the map v 1 — > [x, v] = — [v, x] is denoted ad(x) and is 
linear. With this notation the above becomes 


L = — mad(X)fi(f). 
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The map I : X i— > — ?nad(X)fi(t) = /(X) is called the momentum operator. 
Suppose now that we have k particles of masses mi, m 2 , ...m2 each at 
rigidly attached to the rotating frame and each giving quantities x^, X; etc. 
Then to total angular momentum is 22 f (X,;) . Now if we have a continuum 
of mass with mass density p in a moving region B t (a rigid body) then 
letting X u (t) denote path in so(3) of the point of initially at u £B 0 G R 3 
then we can integrate to get the total angular momentum at time t; 

L = - f ad(X u (f))fl(f)dp(u) 

J B 

which is a conserved quantity. 

26.4.5 The Basic Ideas of Relativity 

We will draw an analogy with the geometry of the Euclidean plane. Recall that 
the abstract plane V is not the same thing as coordinate space R 2 but rather 
there are many “good” bijections T : V — » R 2 called coordinatizations such that 
points p € V corresponding under T to coordinates (x(p),y(p)) are temporarily 
identified with the pair (x(p),y(p)) is such a way that the distance between 
points is given by dist(p, q) = \J ( x(jp ) — x(q )) 2 + ( y(p ) — y(y )) 2 or 

d 2 = Ax 2 + Ay 2 

for short. Now the numbers Aai and Ay separately have no absolute meaning 
since a different good-coordinatization <f> : V — > R 2 would give something like 
(X(p),Y(p)) and then for the same two points p, q we expect that in general 
Ax 7 ^ AX and Ay 7 ^ AY. On the other hand, the notion of distance is a 
geometric reality that should be independent of coordinate choices and so we 
always have Aa ,2 + Ay 2 = AX 2 + AF 2 . But what is a “good coordinatization”? 
Well, one thing can be said for sure and that is if x, y are good then X, Y will 
be good also iff 


/ X \ _ ( cos0 .r.-sinfl \ ( x \ f Ti \ 

V Y ) ~ \ =Fsin0 cos 9 ) \ y ) + \ T 2 ) 

for some 9 and numbers T\,T 2 . The set of all such transformations form a 
group under composition is called the Euclidean motion group. Now the idea 
of points on an abstract plane is easy to imagine but it is really just a “set” 
of objects with some logical relations; an idealization of certain aspects of our 
experience. Similarly, we now encourage the reader to make the following ide- 
alization. Imagine the set of all ideal local events or possible events as a sort 
of 4-dimensional plane. Imagine that when we speak of an event happening 
at location ( x , y, z) in rectangular coordinates and at time t it is only because 
we have imposed some sort of coordinate system on the set of events that is 
implicit in our norms regarding measuring procedures etc. What if some other 
system were used to describe the same set of events, say, two explosions el and 
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e2. You would not be surprised to find out that the spatial separations for the 
two events 

AY, AY, A Z 

would not be absolute and would not individually equal the numbers 


Ay, Ay, Ay. 


But how about AT and At. Is the time separation, in fixed units of seconds 
say, a real thing? 

The answer is actually NO according to the special theory of relativity. 
In fact, not even the quantities AY 2 + AY 2 + A Z 2 will agree with Ax 2 + 
Ay 2 + Ay 2 under certain circumstances! Namely, if two observers are moving 
relative to each other at constant speed, there will be objectively unresolvable 
disagreements. Who is right? There simply is no fact of the matter. The 
objective or absolute quantity is rather 

At 2 - Ax 2 - Ay 2 - Ay 2 


which always equals AT 2 — AX 2 — AY 2 — AY 2 for good coordinates systems. 
But what is a good coordinate system? It is one in which the equations of 
physics take on their simplest form. Find one, and then all others are related 
by the Poincare group of linear transformations given by 


( x ) 


/ x ) 


( Xo \ 

Y 

= A 

y 

+ 

yo 

z 


z 


Zo 

V T ) 




V *0 / 


where the matrix A is a member of the Lorentz group. The Lorentz group is 
characterized as that set 0(1,3) of matrices A such that 


' 1 

0 

0 

0 


' 1 

0 

0 

0 

0 

-1 

0 

0 

A = 

0 

-1 

0 

0 

0 

0 

-1 

0 

0 

0 

-1 

0 

0 

0 

0 

-1 


0 

0 

0 

-1 


This is exactly what makes the following true. 


Fact If (t,x) and (T, X) are related by (f,x) 4 

I 2 


0(1, 3) then t 2 — 


= T 2 - 


X 


A(T,±y + (to, Sc o y for A G 


A vector quantity described relative to an inertial coordinates (t, x) by a 
4-tuple v = (v° , v 1 , v 2 , v 3 ) such that its description relative to (T,X) as above 
is given by 

V 4 = Am 1 (contravariant). 

Notice that we are using superscripts to index the components of a vector (and 
are not to be confused with exponents). This due to the following convention: 
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vectors written with components up are called contravariant vectors while those 
with indices down are called covariant. Contravariant and covariant vectors 
transform differently and in such a way that the contraction of a contravariant 
with a covariant vector produces a quantity that is the same in any inertial 
coordinate system. To change a contravariant vector to its associated covariant 
form one uses the matrix A introduced above which is called the Lorentz metric 
tensor. Thus (v°, v 1 , v 2 , v 3 )A = (u , — v 1 , — v 2 , — v 3 ) := (uo, Ui, U 2 , U 3 ) and thus 
the pseudo-length ViV ' 1 = ( v °) 2 — (u 1 ) 2 — (v 2 ) 2 — (u 3 ) 2 is an invariant with respect 
to coordinate changes via the Lorentz group or even the Poincare group. Notice 
also that Vi v l actually means which is in turn the same thing as 

The so called Einstein summation convention say that when an index is 
repeated once up and once down as in UjU®, then the summation is implied. 

Minkowski Space 

One can see from the above that lurking in the background is an inner product 
space structure: If we fix the origin of space time then we have a vector space 
with inner product (v,v) = 1 \v l = (i >°) 2 — (u 1 ) 2 — (v 2 ) 2 — (u 3 ) 2 . This inner 
product space (indefinite inner product!) is called Minkowski space. The inner 
product just defined is the called the Minkowski metric or the Lorentz 
metric.. 

Definition 26.26 A 4-vector v is called space-like iff (v,v) < 0, time-like iff 
(v, v) > 0 and light-like iff (v, v) = 0. The set of all light-like vectors at a point 
in Minkowski space form a double cone in R 4 referred to as the light cone. 

Remark 26.5 (Warning) Sometimes the definition of the Lorentz metric given 
is opposite in sign from the one we use here. Both choices of sign are popular. 
One consequence of the other choice is that time-like vectors become those for 
which (v, v) < 0. 

Definition 26.27 At each point of 1 f! I 4 there is a set of vectors parallel to 
the 4~ axe s of R 4 . We will denote these by do, 81 , 82 , and 83 (suppressing the 
point at which they are based). 

Definition 26.28 A vector v based at a point in R 4 such that (do, v) > 0 will be 
called future pointing and the set of all such forms the interior of the “ future ” 
light-cone. 

One example of a 4-vector is the momentum 4-vector written p = (E, p ) 
which we will define below. We describe the motion of a particle by referring 
to its career in space-time. This is called its world-line and if we write c for 
the speed of light and use coordinates (x°, x 1 , x 2 , x 3 ) = ( ct , x, y, z) then a world 
line is a curve y(s) = (x°(s), ar(s), x 2 (s), x 3 (s)) for some parameter. The mo- 
mentum 4-vector is then p = tocu where u is the unite vector in the direction 
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LIGHTLIKE TIMELIKE 



of the 4- velocity 7 '(s). The action functional for a free particle in Relativistic 
mechanics is invariant with respect to Lorentz transformations described above. 
In the case of a free particle of mass m it is 


Al 



me <700,7 (s)) 1/2 ds. 


The quantity c is a constant equal to the speed of light in any inertial coordinate 
system. Here we see the need to assume that (7(s),7(s)) > 0 (7(s) is timelike) 
and then there is an obvious analogy with the length of a curve. Define 


t(s)=[ (j(s),j(s)) 1/2 ds 

J Si 


Then £r(s) = cm(j(s), j{s)) 1/2 > 0 so we can reparameterize by r: 


7(t) = (z°(t),2: 1 (t),2: 2 (t),2: 3 (t)). 

The parameter r is called the proper time of the particle in question. A sta- 
tionary curve will be a geodesic or straight line in R 4 . 

Let us return to the Lorentz group. The group of rotations generated by ro- 
tations around the spatial axes x,y,z are a copy of £0(3) sitting inside £0(1, 3) 
and consists of matrices of the form 


1 0 

0 R 
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where R € 50(3). Now a rotation of the x,y plane about the Zjt-plane 1 for 
example has the form 


/ cT \ 


' 1 0 0 o' 


( d \ 

X 


0 cos (9) sin(0) 0 


X 

y 


0 sin(0) cos (9 0 


y 

v ^ ) 


0 0 0 1 


\ * / 


where as a Lorentz “rotation” of the t,x plane about the y,z-plane has the form 


( cT \ 


X 


Y 


V ^ ) 



cosh(/3) 

sinh(/3) 

0 

0 

( ct 



sinh(/3) 

cosh(/3) 

0 

0 

X 

1 


0 

0 

1 

0 

y 



0 

0 

0 

1 

\ * 

) 



= [cT, X, Y, Z] = [(cosh (3) ct + (sinh (3) x, (sinh (3) ct + (cosh (3) x, y, z ] Here the 
parameter f3 is usually taken to be the real number given by 


tanh (3 = v/c 


where v is a velocity indicating that in the new coordinates the observer in 
travelling at a velocity of magnitude v in the x-direction as compared to an ob- 
server in the original before the transformation. Indeed we have for an observer 
motionless at the spatial origin the T, X , Y. Z system the observers path is given 
in T, X , Y, Z coordinates as T i— > (T, 0, 0, 0) 


dX _ 
dt 

dX _ 
dx 

v = 


c sinh [3 

cosh / 3 

dx sinh (3 

dt cosh j3 


c tanh f3. 


Calculating similarly, and using j = 0 we are lead to ^ = ^| = 0. So 

the t, x, y, z observer sees the other observer (and hence his frame) as moving 
in the x- direction at speed v = c tanh [3. The transformation above is called a 
Lorentz boost in the x-direction. 

Now as a curve parameterized by the parameter r (the proper time) the 
4-momentum is the vector 

d 

p = mc—x r . 
dr 

In a specific inertial (Lorentz) frame 

. . dt d , , / me 2 mex mey mez 

W dr dt W \cy/l-{v/c ) 2 y/1 -(v/c ) 2 y/1 ~ (v / c ) 2 y/l ~ {v / c) 2 

x It is the z,t-plane rather than just the z-axis since we are in four dimensions and both the 
z-axis and the t-axis would remain fixed. 
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which we abbreviate to p (t) = (E/c,~p) where p? is the 3- vector given by the 
last there components above. Notice that 

mV = (p(f), p (f)} = E 2 /c 2 + | (p | 2 

is an invariant quantity but the pieces E 2 /c 2 and | p |“ are dependent on the 
choice of inertial frame. 

What is the energy of a moving particle (or tiny observer?) in this theory? 
We claim it is the quantity E just introduced. Well, if the Lagrangian is any 
guide we should have from the point of view of inertial coordinates and for a 
particle moving at speed v in the positive x— direction 

ar f) / \ 

E = v— = v— y—mc 2 \/ 1 — {v/c) 2 J — (— m<?sj\ — ( v/c ) 2 ) 
c 3 c 2 

— 777 , - — iyi 

vV - v 2 ) V(l- (v/c) 2 )' 

Expanding in powers of the dimensionless quantity v/c we have E = me 2 + 

-1 mv 2 + O {{v/c) 4 ). Now the term ^mv 2 is just the nonrelativistic expression 
for kinetic energy. What about the me 2 ? If we take the Lagrangian approach 
seriously, this must be included as some sort of energy. Now if v had been zero 
then we would still have a “rest energy” of me 2 ! This is interpreted as the 
energy possessed by the particle by virtue of its mass. A sort of energy of being 
as it were. Thus we have the famous equation for the equivalence of mass and 
energy (we have v = 0 here): 

E = me 2 . 

If the particle is moving then me 2 is only part of the energy but we can define 
Eq = me 2 as the “rest energy” . Notice however, that although the length of the 
momentum 4-vector mx=p is always m; 

(p, p) 1 ^ 2 = m{{dt/dr) 2 — ( dx/dr ) 2 — ( dy/dr ) 2 — ( dz/dr ) 2 } 4 / 2 = m 

and is therefore conserved in the sense of being constant one must be sure to 
remember that the mass of a body consisting of many particles is not the sum 
of the individual particle masses. 

26.4.6 Variational Analysis of Classical Field Theory 

In field theory we study functions <f> : RxR 3 — > R fe _ yy e use variables (j>(x°, x x ,x 2 , x 3 
x, y. z) A Lagrangian density is a function £(<j>, d</>) and then the Lagrangian 
would be 

L((j>,d^i)= f C(4>,d(j))d 3 x 

Jv CR 3 

and the action is 

S= f f d(j))d 3 xdt = f C{cj),d(j))d A x 

J Jv cR 3 Jvxi CR 4 
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What has happened is that the index i is replaced by the space variable x = 
(a; 1 , a; 2 , a; 3 ) and we have the following translation 


i > — >> — ►> — > x 


q >— > 

q l >— »>—»>— * 

q\{t) ~~~ 

p l (t) >->>—>->-> 

L(q,p) 

S = fL(q,q)dt 


<t> 

(j){t,x) = (f>(x) 

d t <t>(t, x) + x) = d(j>( x) 

/vcr 3 C d&'Px 
S = f f C(4>, d(j))d 3 xdt 


where d<$> = {d 0 4>, d\ (j), d 2 <t>, 9 3 </>). So in a way, the mechanics of classical massive 
particles is classical field theory on the space with three points which is the set 
{1, 2, 3}. Or we can view field theory as infinitely many particle systems indexed 
by points of space-i.e. infinite degrees of freedom. 

Actually, we have only set up the formalism of scalar fields and have not, for 
instance, set things up to cover internal degrees of freedom like spin. However, 
we will discuss spin later in this text. Let us look at the formal variational 
calculus of field theory. We let S<j) be a variation which we might later assume to 
vanish on the boundary of some region in space-time U = I xV C R x I 3 = I 4 . 
In general, we have 


6S = 


,dC 

d<j> 


= / da [5(t> 


dC 

d(dv<t>) 


dC 


d 4 a 


drx 




Now the first term would vanish by the divergence theorem if 5(f> vanished on 
the boundary dU. If <f> were a field that were stationary under such variations 
then 


SS = 



(dC dC \ 


d 4 x = 0 


for all S(f> vanishing on dU so we can conclude that Lagrange’s equation holds 
for (j) stationary in this sense and visa versa: 


dC n dC = n 


These are the held equations. 

26.4.7 Symmetry and Noether’s theorem for field theory 

Now an interesting thing happens if the Lagrangian density is invariant under 
some set of transformations. Suppose that S <f> is an infinitesimal “internal” 
symmetry of the Lagrangian density so that SS(S(/)) = 0 even though S(f> does 
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not vanish on the boundary. Then if cj> is already a solution of the field equations 
then 


0 = SS = [ d fl 
Ju 



d A x 


for all regions U. 
we § et 


This means that 


dc 


V = o 


0 so if we define = 


or 


l f =- v r 


where j =(j 1 ,j 2 ,j 3 ) and V • j = div( j ) is the spatial divergence. This looks 
like some sort of conservation.. Indeed, if we define the total charge at any time 
t by 


Q(t) = 


j°d 3 x 


the assuming j shrinks to zero at infinity then the divergence theorem gives 


d 

dt 


Q{t) = 




V • j d 3 x = 0 


so the charge Q(t) is a conserved quantity. Let Q(U,t) denote the total charge 
inside a region U. The charge inside any region U can only change via a flux 
through the boundary: 

= [ j • n dS 

JdU 

which is a kind of “local conservation law” . To be honest the above discussion 
only takes into account so called internal symmetries. An example of an internal 
symmetry is given by considering a curve of linear transformations of R fc given 
as matrices C(s) with C(0) = I. Then we vary <f> by C{s)4> so that 8<j> = 
^ | C(s)(f> = C'(0)(j>. Another possibility is to vary the underlying space so that 
C(s, .) is now a curve of transformations of R 4 so that if <j> s (x) = <j>(C(s,x)) is 
a variation of fields then we must take into account the fact that the domain of 
integration is also varying: 


L(<j> a ,d(f> a ) = [ C((/) s ,d(l) s )d 4 x 

J t/ s CM 4 


We will make sense of this later. 
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26.4.8 Electricity and Magnetism 

Up until now it has been mysterious how any object of matter could influence 
any other. It turns out that most of the forces we experience as middle sized 
objects pushing and pulling on each other is due to a single electromagnetic 
force. Without the help of special relativity there appears to be two forces; 
electric and magnetic. Elementary particles that carry electric charges such 
as electrons or protons, exert forces on each other by means of a field. In 
a particular Lorentz frame, the electromagnetic field is described by a skew- 
symmetric matrix of functions called the electromagnetic field tensor: 


0 

E x 

Ey 

E z 

—E x 

0 

-B z 

By 

— Ey 

B z 

0 

—B. 

-E z 

— By 

B x 

0 


Where we also have the forms F = A sv Fy S and F = A sl ‘Ff . This tensor 

can be derived from a potential A = (A 0 , Ai,A 2 ,A 3 ) by F ltl/ = The 

contravariant form of the potential is ( A 0 , —A\, —A 2 , — A 3 ) is a four vector often 
written as 

A = (<f>, A). 

The action for a charged particle in an electromagnetic field is written in terms 
of A in a manifestly invariant way as 


—mcdr A u dx M 

c 


so writing A = {<j>, A) we have 


r — > //v 

S= / {—me— e<f)(t) + A • — )dt 

/ f-. L LL (JjL 


so in a given frame the Lagrangian is 

T(x,^,f) = —mc 2 \/l - { v/c ) 2 - e(j>{t) + A • 

Remark 26.6 The system under study is that of a particle in a field and does 
not describe the dynamics of the field itself. For that we would need more terms 
in the Lagrangian. 


This is a time dependent Lagrangian because of the </>{t) term but it turns 
out that one can re-choose A so that the new <j>(t) is zero and yet still have 
F llv = This is called change of gauge. Unfortunately, if we wish 

to express things in such a way that a constant field is given by a constant 
potential then we cannot make this choice. In any case, we have 


T dx , , n 7 TT7 t , -r> dx 

L{ x = — me 2 v 1 — {v/c) 2 — e<f> + A ■ 

dt 


dt 
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and setting and ~v =owe get the follow form for energy 


(9 r , _ _ » . TOC 2 

v 'b=^ L ( X ’ v ’*) _ L ( X ’ v >*) = 7 , s 9 

ov y^l - c/c 2 


e^. 


Now this is not constant with respect to time because is not identically zero. 
On the other hand, this make sense from another point of view; the particle is 
interacting with the field and may be picking up energy from the field. 

The Euler-Lagrange equations of motion turn out to be 


dp 

dt 


eE -I — V x B 
c 


where E = — — grad f> and B = curl A. are the electric and magnetic parts 

of the field respectively. This decomposition into electric and magnetic parts 
is an artifact of the choice of inertial frame and may be different in a different 

frame. Now the momentum p is , m v = but a speeds v « c this becomes 

V 1- 0/c) 2 

nearly equal to tov so the equations of motion of a charged particle reduce to 

dxv - e _> - 

to—— = eE H — v x B. 
dt c 

Notice that is the particle is not moving, or if it is moving parallel the magnetic 
field B then the second term on the right vanishes. 


The electromagnetic field equations. 

We have defined the 3- vectors E = — — grad <f> and B = curl A but since 

the curl of a gradient is zero it is easy to see that curl E = — 1 4^ . Also, from 
B = curl A we get divB = 0. This easily derived pair of equations is the first 
two of the four famous Maxwell’s equations. Later we will see that the electro- 
magnetic field tensor is really a differential 2-form F and these two equations 
reduce to the statement that the (exterior) derivative of F is zero: 

dF = 0 


Exercise 26.1 Apply Gauss’s theorem and stokes theorem to the first two Maxwell’s 
equations to get the integral forms. What do these equations say physically? 


One thing to notice is that these two equations do not determine J^E. 

Now we have not really written down a action or Lagrangian that includes 
terms that represent the field itself. When that part of the action is added in 
we get 

S = [ (-mc—-An ( ^—)dT + a f F u ^F V udx A 
Ja c ^ dr J v 


where in so called Gaussian system of units the constant a turns out to be . 
Now in a particular Lorentz frame and recalling ?? we get = a f v F l '^F l , f j j dx‘ l = 


8tt 



B 


dtdxdydz. 
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In order to get a better picture in mind let us now assume that there is a 
continuum of charged particle moving through space and that volume density 
of charge at any given moment in space-time is p so that if dxdydz = dV 
then pdV is the charge in the volume dV . Now we introduce the four vector 
pu = p(dx/dr) where u is the velocity 4- vector of the charge at ( t,x, y,z ). Now 
recall that pdx/dr = ^ {p , pAr) = jjj{p, j) = j. Here j =pAr is the charge current 
density as viewed in the given frame a vector field varying smoothly from point 
to point. Write j = (j ° , j 1 , j 2 , j 3 ). 

Assuming now that the particle motion is determined and replacing the 
discrete charge e be the density we have applying the variational principle with 
the region U = [a, b] x V says 

0 = ~ 6 ^f f dVA i d ^ dT + a J F^F^dx^j 

= - S (If j*' 4 m + aF^F^dxf 


Now the Euler-Lagrange equations become 

dC dC 

dA„ >J, d(d fJ A l/ ) 

where £(A fx , d^A^) = and F^ v = • If one is careful 

to remember that d il A v = is to be treated as an independent variable one 
cane arrive at some complicated looking equations and then looking at the 
matrix ?? we can convert the equations into statements about the fields E, 
B,and (p,j). We will not carry this out since we later discover a much more 
efficient formalism for dealing with the electromagnetic held. Namely, we will 
use differential forms and the Hodge star operator. At any rate the last two of 
Maxwell’s equations read 

curl B = 0 
div E = Airp. 

26.4.9 Quantum Mechanics 

26.5 E. Calculus on Banach Spaces 

Mathematics is not only real, but it is the only reality. That is that 
the entire universe is made of matter is obvious. And matter is made of 
particles. It’s made of electrons and neutrons and protons. So the entire 
universe is made out of particles. Now what are the particles made out 
of? They’re not made out of anything. The only thing you can say about 
the reality of an electron is to cite its mathematical properties. So there’s 
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a sense in which matter has completely dissolved and what is left is just 
a mathematical structure. 

Gardner on Gardner: JPBM Communications Award Presentation. 
Focus-The Newsletter of the Mathematical Association of America v. 14, 
no. 6, December 1994. 


26.6 Categories 

We hope the reader has at least some familiarity with the notion of a category. 
We will not use category theory in any deep way except perhaps when discussing 
homological algebra. Nevertheless the language of categories is extremely con- 
venient and so we will review a few of the salient features. It is not necessary 
for the reader to be completely at home with category theory before going fur- 
ther into the book and so he or she should not be discouraged if this section 
seems overly abstract. In particular, physics students may not be used to this 
kind of abstraction and should simply try to slowly get used to the language of 
categories. 

Definition 26.29 A category £ is a collection of objects Obj(C) = {X, Y, Z , ...} 
and for every pair of objects X,Y a set Honi£(X, Y) called the set of mor- 
phisms from X to Y. The family of all such morphisms will be denoted Mor(C). 
In addition a category is required to have a composition law which is a map 
o : Hom£(X, Y) x Homc(Y, Z) — » Hom<r(X, Z) such that for every three objects 
X,Y,Z £ Obj(€) the following axioms hold: 


Axiom 26.1 (Catl) Hom^X, Y) andEom^(Z,W) are disjoint unless X = Z 
and Y = W in which case Hom<r(X, Y) = Horn c(Z,W). 

Axiom 26.2 (Cat2) The composition law is associative: fo(goh) = (fog)oh. 

Axiom 26.3 (Cat3) Each set of morphisms of the form Hom£(X, X) must 
contain an identity element idy such that, f oidx = / for any f £ Homg(X, Y) 
(and any Y), and idy °f = f for any f £ Hom<r(Y, X). 

Notation 26.1 A morphism is sometimes written using an arrow. For exam- 
ple, if f £ Home(X, Y) we would indicate this by writing f : X — > Y or also 

by X — > Y . Also, the set Hom<r(X, Y) of all morphisms from X to Y is also 
denoted by Morc(X,Y) or abbreviated to Hom(X, Y). 

The notion of category is typified by the case where the objects are sets and 
the morphisms are maps between the sets. In fact, subject to putting restrictions 
on the sets and the maps, this will be almost the only type of category we shall 
need. On the other hand there are plenty of categories of this type: 

1. Grp: The objects are groups and the morphisms are group homomor- 
phisms. 
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2. Rng : The objects are rings and the morphisms are ring homomorphisms. 

3. Lin : The objects are vector spaces and the morphisms are linear maps. 

4. Top: The objects are topological spaces and the morphisms are continuous 
maps. 

5. Man’’: The category of C' r — differentiable manifolds (and C r — maps): 
One of the main categories discussed in this book. This is also called the 
smooth or differentiable category especially when r = oo. 

Notation 26.2 If for some morphisms fi : Xi — » Yi , (i = 1,2), gx ■ X\ — > X 2 
and gy '■ Yi — > Y 2 we have gy o f x = f 2 o g x then we express this by saying that 
the following diagram “commutes'’: 

fi 

Xi - Yi 

gx I I 9y 

x 2 - y 2 
h 

Similarly, if ho f = g we say that the diagram 

f 

X -> Y 
\ I h 
g z 

commutes. More generally, tracing out a path of arrows in a diagram corre- 
sponds to composition of morphisms and to say that such a diagram commutes 
is to say that the compositions arising from two paths of arrows which begin and 
end at the same object are equal. 

Definition 26.30 Suppose that f : X — > Y is a morphism from some category 
£. If f has the property that for any two (parallel) morphisms g±, g 2 : Z — > X we 
always have that fog 1 = fog 2 implies g 1 = 52 , be. if f is “left cancellable” , then 
we call f a €-monomorphism. Similarly, if f : X —> Y is “right cancellable” 
we call f a (t-epimorphism. A morphism that is both a monomorphism and 
an epimorphism is called an isomorphism (or €- isomorphism if the category 
needs to be specified). 

In some cases we will use other terminology. For example, an isomorphism in 
the differentiable category is called a diffeomorphism. In the linear category, 
we speak of linear maps and linear isomorphisms. Morphisms from Hom£(X, X) 
are also called endomorphisms and so we also write Ende(X) := Homu;(X, X). 
The set of all ^-isomorphisms in Homc(X, X) is sometimes denoted by Aut<r(X) 
and are called automorphisms. 
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We single out the following: In many categories like the above we can form 
a sort of derived category that uses the notion of pointed space and pointed 
map. For example, in the topological category a pointed topological space is 
an topological space X together with a distinguished point p. Thus a typical 
object would be written (X,p). A morphism / : (X,p) —> (W,q) would be a 
continuous map such that f(p) = q. 

A functor F is a pair of maps both denoted by the same letter F which 
map objects and morphisms from one category to those of another 

F '■ Ob(£ i) Ob{€ 2 ) 

F : Mor{<L\) — > Mor(€ 2 ) 

such that composition and identity morphisms are respected: If / : X — > Y 
then 

F(f) : F(X) -4 F(Y) 

is a morphism in the second category and we must have 

1. F(id Cl ) = id e2 

2. If / : X -4 Y and g : Y -+ Z then F(f) : F(X) -4 F(Y), F{g) : F(Y) -4 
F{Z) and finally 

F{g°f) = F(g)°F(f). 

This last property is perhaps the salient feature of a functor. 

Example 26.7 Let Linn be the category whose objects are real vector spaces 
and whose morphisms are real linear maps. Similarly, let Line be the category 
of complex vector spaces with complex linear maps. To each real vector space 
V we can associate the complex vector space C <S>r V called the complexifi cation 
of V and to each linear map of real vector spaces l : V -4 W we associate the 
complex extension tc '■ C ®r V— > C <S>r W. Here, C <S>r V is easily thought of as 
the vector space V where now complex scalars are allowed. Elements of C®r V 
are generated by elements of the form c ® v where c £ C, vG V and we have 
i(c®v) = ic®v where i = \J— 1. The map lc '■ C ®r V— > C ®r W is defined by 
the requirement fe( c ® v) = c ® tv. Now the assignments 


t 44 t c 
V I — 4 (C 0R V 

define a functor from LinR to Line- 

Remark 26.7 In practice, complexification amounts to simply allowing com- 
plex scalars. For instance, we might just write cv instead of c <S> v. 

Actually we have defined what is called a covariant functor. A con- 
travariant functor is defined similarly except that the order of composition 
is reversed and instead of Funct2 above we would have F(g ° /) = F(f) ° F(g). 
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An example of a functor contravariant is the dual vector space functor which is 
a functor from the category of vector spaces LinR to itself and sends each space 
to its dual and each linear map to its dual (adjoint): 

V 4 W 

I . I 
v* 4 w* 

Notice the arrow reversal. 

Remark 26.8 One of the most important functors for our purposes is the tan- 
gent functor defined in section 3. Roughly speaking this functor replaces dif- 
ferentiable maps and spaces by their linear parts. 

Example 26.8 Consider the category of real vector spaces and linear maps. 
To every vector space V we can associate the dual of the dual V** . This is a 
covariant functor which is the composition of the dual functor with itself: 


V 

W* 

y** 

A | 1 ► 

A* l 

^ A** | 

W 

V* 

w** 


Now suppose we have two functors 

Fi ■■ Ob(£ i) -► Ob(£ 2 ) 

F i : Mor(<t i) — > Mor(£ 2) 

and 


Fi ■ Ob{<Z 1 ) -► Ob(£ 2 ) 

F 2 : Mor{<L 1 ) — > Mor(€. 2 ) 

A natural transformation T from Fi to Fi is an assignment to each object 
X of £1 a morphism T(X) : F i(AT) — » F i{X) such that for every morphism 
/ : X — > Y of £1 we have that the following diagram commutes: 

T{X) 

Fi(X) -► F 2 PO 

Fi(f) l IF 2 (/) 

FiOO ^ Fi{Y) 

T(Y) 

A common first example is the natural transformation t between the identity 
functor I : LinR — > LinR and the double dual functor t : LinR — >■ LinR: 


r(V) 


V 

y** 

1 

1 /** 

w 

w** 

t (W) 
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Here V** is the dual space of the dual space V*. The map V — > V** sends a 
vector to a linear function v : V* — » R defined by v(a) := a(v) (the hunter 
becomes the hunted so to speak). If there is an inverse natural transformation 
T -1 in the obvious sense, then we say that T is a natural isomorphism and for 
any object X G we say that F i(X) is naturally isomorphic to F 2 ^) • The 
natural transformation just defined is easily checked to have an inverse so is a 
natural isomorphism. The point here is not just that V is isomorphic to V** in 
the category Linn but that the isomorphism exhibited is natural. It works for 
all the spaces V in a uniform way that involves no special choices. This is to 
be contrasted with the fact that V is isomorphic to V* but the construction of 
such an isomorphism involve an arbitrary choice of a basis. 


26.7 Differentiability 

For simplicity and definiteness all Banach spaces in this section will be real 
Banach spaces. First, the reader will recall that a linear map on a normed 
space, say A : Vi — ► V 2 , is bounded if and only it is continuous at one and 
therefore any point in Vi. Given two Banach spaces Vi and V 2 with norms H.^ 
and ||.|| 2 we can form a Banach space from the Cartesian product Vi x V 2 by 
using the norm ||(v, u)|| := maxHlvj^ , ||u|| 2 }. There are many equivalent norms 
for Vi x V 2 including 



Recall that two norms on V, say H-lfand ||.|| ,/ are equivalent if there exist positive 
constants c and C such that 


cm' < nxir<ciixir 

for all x £ V. Also, if V is a Banach space and Wi and W 2 are closed subspaces 
such that \AAnW 2 = {0} and such that every v £ V can be written uniquely in 
the form v = wq+wq where wi eWi and W 2 eW 2 then we write V =Wi©W 2 . In 
this case there is the natural continuous linear isomorphism W 1 XW 2 =Wi©W 2 
given by 

(wi, w 2 ) < — » Wi + w 2 . 

When it is convenient, we can identify \Ni®\N- 2 with W 1 xW 2 . Under the 
representation (wi,w 2 ) we need to specify what norm we are using and there 
is more than one natural choice. We take ||(wi, w 2 )| := max{||wi|| , ||w 2 ||} but 

equivalent norms include, for example, ||(wi,w 2 )|| 2 := \J ||wi ||" + || w 2 1| 2 which 
is a good choice if the spaces happen to be Hilbert spaces. 

Let E be a Banach space and Wc E a closed subspace. We say that W is 
complemented or a split subspace of E if there is a closed subspace W' 
such that E =W©W'. 
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Definition 26.31 (Notation) We will denote the set of all continuous (bounded) 
linear maps from a Banach space E to a Banach space F by L(E. F). The set of 
all continuous linear isomorphisms from E onto F will be denoted by GL(E, F). 

In case, E = F the corresponding spaces will be denoted by fll(E) and GL(E). 

Here GL(E) is a group under composition and is called the general linear 
group . In particular, GL(E, F) is a subset of L{ E, F) but not a linear subspace. 

Definition 26.32 Let V. u i = 1, ..., k and W be Banach spaces. A map p :V 1 
x • • • x \4 - W is called multilinear (k -multilinear) if for each i, 1 < i < k 
and each fixed (wi, w), ..., Wfc) £ V± x • • • x Vi x • • • x \4 we have that the map 

i-th 

obtained by fixing all but the i-th variable, is a bounded linear map. In other 
words, we require that p be R- linear in each slot separately. 

A multilinear map p :Vi x • • • xVt — » W is said to be bounded if and only 
if there is a constant C such that 

||M v i)V2, ---> Vfc) || w < C ||vi|| Ei ||v 2 || E2 - - - ||v fe || Efc 

for all (v!,...,Vfc) £ Ei x • • • x E*,. The set of all bounded multilinear maps 
Ei x • • • x Efc — > W will be denoted by L(Ei, E&; W). If Ei = • • • = E& = E 
then we write L k ( E; W) instead of L(E, ..., E; W) 

In case we are dealing with a be a Hilbert space E,(., .) then we have the 
group of linear isometries 0(E) from E onto itself. That is, 0(E) consists of the 
bijective linear maps $ : E^E such that (<I>v, >!>«/) = (v,w) for all v,w € E. The 
group O(E) is called the orthogonal group (or sometimes the Hilbert group 
in the infinite dimensional case). 

Definition 26.33 A (bounded) multilinear map p : V x • • • x V — > W is called 
symmetric (resp. skew- symmetric or alternating) iff for any Vi, V 2 , ..., v*, £ 

V we have that 


p(v i, V 2 , ..., Vfc) = K(\! al ,V a 2 , -No k) 
resp. p(\/i, v 2 , ..., v fc ) = sgn(a)p(\i al , v a 2, Vok) 

for all permutations a on the letters (1, 2, ...., k}. The set of all bounded sym- 
metric (resp. skew-symmetric) multilinear maps V x ••• x V — > W is denoted 
L k S ymVm (resp. L k skew (y ; W) orL^(V;W)). 

Now the space L(V, W) is a Banach space in its own right with the norm 

11*11 = sup = SU p{||/(v) || w : ||v|| v = 1}. 

vev !|v|| v 

The spaces L( E 1; .... E^; W) are also Banach spaces normed by 

\\p\\ := sup{||/z(vi, v 2 , ..., v fe )|| w : ||vi|| E . = 1 for i= 1 ,..,k} 
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Proposition 26.3 A k-multilinear map p G L(Ei, ..., E/y W) is continuous if 
and only if it is bounded. 

Proof. (^=) We shall simplify by letting k = 2. Let (ai,a 2 ) and (vi,v 2 ) be 
elements of Ei x E 2 and write 


/i(vi,v 2 ) - /z(ai,a 2 ) 

= M v i - ai,v 2 ) + p(a 1 , v 2 - a 2 ). 

We then have 

||m(vi, v 2 ) — ju(ai, a 2 )|| 

< C || V! — a i|| |H| + C || a id || v 2 — a 2 | 
and so if ||(v 1; v 2 ) — (ai,a 2 )|| — > 0 then ||vj — a., || — > 0 and we see that 


||/r(vi,v 2 ) - /z(a 1 ,a 2 )|| -»■ 0. 


(=>) Start out by assuming that p is continuous at (0,0). Then for r > 0 
sufficiently small, (vi,v 2 ) G B{{ 0,0), r) implies that ||/i(v 1 ,v 2 )|| < 1 so if for 
i = 1 , 2 we let 

z, := - — — i for some e > 0 

ll v i|li + e 

then (zj,z 2 ) G i?((0, 0),r) and ||/x(z 1 ,z 2 )|| < 1. The case (v l5 v 2 ) = (0,0) is 
trivial so assume (vi,v 2 ) ^ (0,0). Then we have 


M(zi,z 2 ) = m(. 


rvi 


rv 2 


v i I + e v 2 


l|vi||+e)(||v 2 || + e) 


A«(vi, v 2 ) < 1 


and so p(\/i, v 2 ) < r _2 (||vi| + e)(||v 2 || + e). Now let e — > 0 to get the result. ■ 
We shall need to have several Banach spaces handy for examples. For the 
next example we need some standard notation. 


Notation 26.3 In the context o/M", we often use the so called “multiindex 
notation”. Let a = (ay, ...,a n ) where the at are integers and 0 < a,; < n. Such 
an n-tuple is called a multiindex. Let |a| := a i + ... + a n and 

d a f _ dH/ 

dx° <9(a’ 1 ) ai d(ad) Q2 ---<9(a: 1 ) a W 


Example 26.9 Consider a bounded open subset f 2 o/K”. Let L p k (H) denote the 
Banach space obtained by taking the Banach space completion of the set C k (Ll) 
of k -times continuously differentiable real valued functions on f l with the norm 
given by 


i/p 


ll/ll 


k,p ' 



d a 


Note that in particular Lg(fl) = L p (fl ) is the usual L p — space from real analysis. 
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Exercise 26.2 Show that the map C k (D) — > C k 1 (fi) given by f i— > is 
bounded if we use the norms ||/|| 2 p and ||/|| 2 _i p- Show that we may extend 
this to a bounded map L 2 (fl) — ■> L\{ ft). 

Proposition 26.4 There is a natural linear isomorphism L(V, L(V, W)) = L 2 (V, W) 
given by 

l(v i)(v 2 ) < — » l{vi,v 2 ) 

and we identify the two spaces. In fact, L(V, L(V, L(V, W)) = L 3 (V; W) and in 
general L(V, L(V, L(V, L(V, W)) S L fc (V; W) etc. 

Proof. It is easily checked that if we just define ( l T)(v i)(v 2 ) = T(vi,v 2 ) 
then (, T <-> T does the job for the k = 2 case. The k > 2 case can be done by 
an inductive construction and is left as an exercise. It is also not hard to show 
that the isomorphism is continuous and in fact, norm preserving. ■ 

Definition 26.34 A function f : U C V — > W between Banach spaces and 
defined on an open set U C V is said to be differentiable at p G U iff there is 
a bounded linear map A p £ L(y,\N) such that 

lim /(P + h W(pl-V h = 0 

IIMHo |jh|| 

In anticipation of the following proposition we write A p = Df{ p). We will also 
use the notation Df | or sometimes f{p). The linear map Df{ p) is called the 

derivative of f at p. 

We often write Df | ■ h. The dot in the notation just indicate a linear 
dependence and is not a literal “dot product”. We could also write D/(p)(h). 

Exercise 26.3 Show that the map F : L 2 ( D) — > L 1 { D) given by F(f) = / 2 is 
differentiable at any ff £ L 2 (I2). 

Proposition 26.5 If A p exists for a given function f then it is unique. 

Proof. Suppose that A p and B p both satisfy the requirements of the defi- 
nition. That is the limit in question equals zero. For p + h £U we have 

A p b - B p h = (/( p + h) - /( p) - A p ■ h) 

-(/( P+ h ) -/( P) -B p -h). 

Dividing by ||h|| and taking the limit as ||h|j — * 0 we get 

||A p h-B p h||/||h||— >0 

Now let h 0 be arbitrary and choose e > 0 small enough that p + eh £ U. 
Then we have 

||A p (e h )-B p (eh)||/||eh|| -► 0. 
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But by linearity ||A p (eh) — _B p (eh)|| / || eh || = || ^4 p h — B p h|| / ||h|| which doesn’t 
even depend on e so in fact || A p b — £> p h|| = 0. ■ 

If we are interested in differentiating “in one direction at a time” then we 
may use the natural notion of directional derivative. A map has a directional 
derivative D^f at p in the direction h if the following limit exists: 

(D h /)(p) 

In other words, I?h/(p) = s| i=0 /(p+fh). But a function may have a directional 
derivative 2 in every direction (at some fixed p), that is, for every h £ E and yet 
still not be differentiable at p in the sense of definition 26.34. Do you remember 
the usual example of this from multivariable calculus? 

Definition 26.35 If it happens that a function f is differentiable for all p 
throughout some open set U then we say that f is differentiable on U . We 
then have a map Df : U C V — »L(V, W) given by p i— > Df( p). If this map is dif- 
ferentiable at some p £ V then its derivative at p is denoted DDf(p) = D 2 f{ p) 
or D 2 f | and is an element of L(V, L(V, W)) = L 2 (V;W). Similarly, we may 
inductively define D k f £ L fc (V; W) whenever f is sufficiently nice that the pro- 
cess can continue. 

Definition 26.36 We say that a map f : U C V — > W is C r — differentiable 
on U if D r f | € L r (y , W) exists for all p £ U and if continuous D r f as map 
U — > L r (y, W). If f is C r — differentiable on U for all r > 0 then we say that f 
is C°° or smooth (on U). 

Exercise 26.4 Show directly that a bounded multilinear map is C°° . 

Definition 26.37 A bijection f between open sets U a C V and Up C W is 
called a C r — diffeomorphism iff f and / - l are both C r — differentiable (on U a 
and Up respectively). If r = oo then we simply call f a diffeomorphism. Often, 
we will have W = V in this situation. 

Let U be open in V. A map f : U — > W is called a local C r diffeomorphism 
iff for every p £ U there is an open set U p C U with p £ U p such that. f\ v : 
U p — > / (C/ p ) is a C r — diffeomorphism. 

In the context of undergraduate calculus courses we are used to thinking 
of the derivative of a function at some a € 1 as a number f'(a) which is the 
slope of the tangent line on the graph at (a, f(a)). From the current point of 
view Df(a) = Df | just gives the linear transformation h i— > f (a) ■ h and the 
equation of the tangent line is given by y = f(a) + f'(a)(x — a). This generalizes 
to an arbitrary differentiable map as y = /(a) + Df( a) • (x — a) giving a map 
which is the linear approximation of / at a. 

2 The directional derivative is written as ( Oh / ) ( P J and if / is differentiable at p this is equal 
to Df | p h. The notation D\, f should not be confused with Df | h . 
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We will sometimes think of the derivative of a curve 3 c : / C R. — > E at to G I, 
written c(t o), as a velocity vector and so we are identifying c(to) G L(R, E) with 
Dc| io • 1 G E. Here the number 1 is playing the role of the unit vector in R. 

Let f :U CE— >Fbea map and suppose that we have a splitting E = Ei x E 2 
. We will write /(x, y) for (x, y) G E 3 xE 2 . Now for every (a, b) G E 3 xE 2 the 
partial map / a> : y e- > /( a,y) (resp. /, b : x i— ► /(x, b)) is defined in some 
neighborhood of b (resp. a). We define the partial derivatives when they exist 
by D 2 f{ a, b) = Df a f b) (resp. Z?i/(a, b) = Df^{ a)). These are, of course, linear 
maps. 


Dif(a, b) : Ei — ► F 

Z? 2 /(a,b) : E 2 — ^ F 

The partial derivative can exist even in cases where / might not be differentiable 
in the sense we have defined. The point is that / might be differentiable only 
in certain directions. 

If / has continuous partial derivatives Di/(x, y) : Ej — ■> F near (x, y) G Ei x E 2 
then Df(x, y) exists and is continuous near p . In this case, 

Df(x, y) • (vi,v 2 ) 

= £>i/(x, y) ■ Vi + D 2 f{x, y) • v 2 


26.8 Chain Rule, Product rule and Taylor’s The- 
orem 

Theorem 26.3 (Chain Rule) Let U\ and U 2 be open subsets of Banach spaces 
Ei and E 2 respectively. Suppose we have continuous maps composing as 

U x 4 U 2 ^ E 3 

where E 3 is a third Banach space. If f is differentiable atp and g is differentiable 
a t f(p) then the composition is differentiable at p and D(g o /) = Dg(f( p)) o 
Dg( p). In other words, ifv£ Ei then 

D (g°f)\ p -v= Dg\ f(p) ■ (Df | p - v). 

Furthermore, if f G C r (Ui) and g G C r (U 2 ) then g o / G C r (U\). 

Proof. Let us use the notation Oi(v), 0 2 (v) etc. to mean functions such 
that Oj(v) — » 0 as ||v|| — > 0. Let y = /(p). Since / is differentiable at p we have 
/( p + h) = y + Df | ■ h + ||h|| Oi(h) := y + Ay and since g is differentiable at y 
we have g(y + Ay) = Dg | • (Ay) + |j Ay || 0 2 (Ay). Now Ay — > 0 as h — > 0 and in 

3 We will often use the letter I to denote a generic (usually open) interval in the real line. 
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turn 02(Ay) — » 0 hence 

9°f( P+h) = g(y + Ay) 

= D g l y • ( A y) + II a yII o 2 (Ay) 

= Dg\ y ■ (Df\ p ■ h + |jh|j Oi(h)) + ||h|| 0 3 (h) 

= Dg\ y ■ Df\ p • h + ||h|| Dg\ y • O^h) + ||h|| 0 3 (h) 

= Dg\ y - Df\ p .h+||h||0 4 (h) 

which implies that g o f is differentiable at p with the derivative given by the 
promised formula. 

Now we wish to show that f,g&C r r>l implies that go f g C r also. The 
bilinear map defined by composition comp : L( Ei, E2) x L( E2, E3) — > L(Ei, E3) 
is bounded . Define a map 

m f,g ■ P (Dg(f(p),Df(p)) 

which is defined on U\. Consider the composition comp o m.f, g . Since / and g 
are at least C 1 this composite map is clearly continuous. Now we may proceed 
inductively. Consider the r — th statement: 

composition of C r maps are C r 

Suppose / and g are C r+1 then Df is C r and Dg o f is C r by the inductive 
hypothesis so that is C r . A bounded bilinear functional is C°°. Thus comp 
is C°° and by examining comp o rri we see that the result follows. ■ 

We will often use the following lemma without explicit mention when calcu- 
lating: 

Lemma 26.2 Let f : U CV^W be twice differentiable at x 0 £ U C V then 
the map D v f : x 1— > Df(x) ■ v is differentiable at x 0 and its derivative at x 0 is 
given by 

D(D v /)| Xo .h = D 2 /(x 0 )(h,v). 

Proof. The map D v f : x 1— > Df{x) ■ v is decomposed as the composition 

x ^ Df\ x £ Df\ x ■ v 

where R v : L(V. W) 1— > W is the map ( A , b) 1— > A ■ b. The chain rule gives 

D(D v f)(*o) ■ h = DRffDf\J • D{Df)\ Xo ■ h) 

= DRffDf(x 0 )) • (D 2 /(x 0 ) • h). 

But R v is linear and so DR v (y) = R v for all y. Thus 

D(D v f)\ xg ■ h = R v (D 2 f(xo) ■ h) 

= (-D 2 /( x o) • h) • v = D 2 /(x 0 )(h,v). 
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Theorem 26.4 If f : U C V — » W is twice differentiable on U such that D 2 f 
is continuous, i.e. if f £ C 2 (U ) then D 2 f is symmetric: 

D 2 /(p)(w,v) = D 2 f{p){\i, w). 

More generally, if D k f exists and is continuous then D k f (p) £ ^ m (V; W). 


Proof. Let p £ U and define an affine map A : R 2 — > V by A(s,t) := 
p + sv + t w. By the chain rule we have 

0 (°) = D2 (f oA )( °) ’ ( e i, e 2 ) = D 2 f( p) • (v,w) 

where e l5 e 2 is the standard basis of R 2 . Thus it suffices to prove that 


d 2 (f o A) 
dsdt 


(0) 


d 2 {f o A) 
dtds 


( 0 ). 


In fact, for any l £ V* we have 




(0) 


and so by the Hahn-Banach theorem it suffices to prove that - g°g° (0) = 

9 dtds (0) which is the standard 1-variable version of the theorem which we 
assume known. The result for D k f is proven by induction. ■ 


Theorem 26.5 Let g £ L( Fi, F 2 ; W) be a bilinear map and let f\ : U C E — * Fi 
and f -2 : U C E — > F 2 be differentiable (resp. C r ,r > 1) maps. Then the 
composition £»(/i,/ 2 ) is differentiable (resp. C r ,r > 1 ) on U where £>(/i,/ 2 ) : 
x ^ £»(/i(x), / 2 (x)). Furthermore, 

Dg\ x (/i,/ 2 ) • v =g(Dft | x • v, / 2 (x)) + p(/i(x), T»/ 2 | x • v). 

In particular, if F is an algebra with differentiable product * and fi : U C E — F 
and f -2 : U C E — » F then fi * / 2 is defined as a function and 

D(fi ★ / 2 ) ■ v = (Dfi ■ v) * (/ 2 ) + (Dff ■ v) * (Df 2 ■ v). 

Proof. This is completely similar to the usual proof of the product rule and 
is left as an exercise. ■ 

The proof of this useful lemma is left as an easy exercise. It is actually quite 
often that this little lemma saves the day as it were. 

It will be useful to define an integral for maps from an interval [a, b] into a 
Banach space V. First we define the integral for step functions. A function / on 
an interval [a, b] is a step function if there is a partition a = to < t\ < ■ ■ ■ < 
tk = b such that / is constant, with value say fi, on each subinterval [U, U+i). 
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The set of step functions so defined is a vector space. We define the integral of 
a step function / over [a, b } by 


n, k— 1 k—1 

/ f ■= “ *<)/(*i)- 

b l *= 0 2—0 


One checks that the definition is independent of the partition chosen. Now the 
set of all step functions from [a, b\ into V is a linear subspace of the Banach 
space B(a, b, V) of all bounded functions of [a, 6] into V and the integral is a 
linear map on this space. Recall that the norm on B(a, b, V) is sup a<t<b {/(*)}• 
If we denote the closure of the space of step functions in this Banach space 
by S(a,b, V) then we can extend the definition of the integral to S(a, b, V) by 
continuity since on step functions we have 


<(b-a)\ 


In the limit, this bound persists. This integral is called the Cauchy-Bochner 
integral and is a bounded linear map S(a, b, V) — > V. It is important to notice 
that S(a, b, V) contains the continuous functions C([a, fe],V) because such may 
be uniformly approximated by elements of S(a, b, V) and so we can integrate 
these functions using the Cauchy-Bochner integral. 


Lemma 26.3 If i : V — ► W is a bounded linear map of Banach spaces then for 
any f € S(a, b, V) we have 



i o f = l o 



Proof. This is obvious for step functions. The general result follows by 
taking a limit for step functions converging in S(a, b, V) to /. ■ 


Some facts about maps on finite dimensional spaces. 

Let U be an open subset of K" and let / : U — > R m be a map which is differen- 
tiable at a = (a 1 , ..., a n ) € K™. The map / is given by m functions /* : U — > R m 
, 1 < i < m. Now with respect to the standard bases of M" and respectively, 
the derivative is given by an n x m matrix called the Jacobian matrix: 


Uf) ■■= 


fg(«) fS(«) ■■ 

Sr (a) 

• gi(“) 

!?(<•) 

§£(■» 


The rank of this matrix is called the rank of / at a. If n = m so that / : 
U C R" — > R n then the Jacobian is a square matrix and det(J a (/)) is called 
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the Jacobian determinant at a. If / is differentiable near a then it follows 
from the inverse mapping theorem proved below that if det(J a (/)) 7^ 0 then 
there is some open set containing a on which / has a differentiable inverse. The 
Jacobian of this inverse at f(x) is the inverse of the Jacobian of / at x. 

The following is the mean value theorem: 

Theorem 26.6 Let V and W be Banach spaces. Let c : [a, 6] — > V be a C 1 — map 
with image contained in an open set U C V. Also, let f : U — > W be a C 1 map. 
Then 

- f(c(a)) = [ Df(c(t)) ■ c'(t)dt. 

Jo 

If c(t) = (1 — t)x + ty then 

/( y) - /M = ( Df(c(t))dt • (y - x) 

Notice that D f{c{t))dt £ L(V, W). 

Proof. Use the chain rule and the 1-variable fundamental theorem of cal- 
culus for the first part. For the second use lemma 26.3. ■ 

Corollary 26.1 Let U be a convex open set in a Banach space V and f : U — > W 
a C 1 map into another Banach space W. Then for any x, y £U we have 

ll/(y) - /Mil < Cx, y ||y-x|| 

where C x , y is the supremum over all values taken by f along the line segment 
which is the image of the path 1 1— > (1 — <)x + ty. 

Recall that for a fixed x higher derivatives D p f | are symmetric multilinear 
maps. For the following let ( y) k denote (y,y , . . . , j/) . With this notation we have 

fc-times 

the following version of Taylor’s theorem. 

Theorem 26.7 (Taylor’s theorem) Given Banach spaces M and \N, a C r func- 
tion f : U — > W and a line segment t 1— > (1 — f)x + ty contained in U, we have 
that 1 1— > D p f(x + ty) ■ ( y) p is defined and continuous for 1 < p < k and 

f(x + y) = f(x) + I Df\ x -y+J D 2 f\ x ■ ( y ) 2 + • • • + D k ~ 1 f\ x ■ ( y ) k ^ 

+ Si Dk ^ x + • M kdt 

Proof. The proof is by induction and follows the usual proof closely. The 
point is that we still have an integration by parts formula coming from the 
product rule and we still have the fundamental theorem of calculus. ■ 
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26.9 Local theory of maps 

Inverse Mapping Theorem 

The main reason for restricting our calculus to Banach spaces is that the inverse 
mapping theorem holds for Banach spaces and there is no simple and general 
inverse mapping theory on more general topological vector spaces. 

Definition 26.38 Let E and F be Banach spaces. A map will be called a C r 
diffeomorphism near p if there is some open set U C dom(/) containing p 
such that f\jj:U — > f(U) is a C r diffeomorphism onto an open set f(U). The 
set of all maps which are diffeomorphisms near p will be denoted Diffp(E, F). If 
f is a C r diffeomorphism near p for all p € U = dom(/) then we say that f is 
a local C r diffeomorphism. 

Definition 26.39 Let X,d\ and Y,g ?2 be metric spaces. A map f : X —> Y is 
said to be Lipschitz continuous (with constant k) if there is a k > 0 such that 
d(f(xi),f(x 2 )) < kd{x 1 , 2 : 2 ) for all X\,X 2 G X. If 0 < k < 1 the map is called 
a contraction mapping (with constant k) or is said to be k- contractive. 

The following technical result has numerous applications and uses the idea 
of iterating a map. Warning: For this theorem f n will denote the n— fold 
composition f o f o ■ ■ ■ o f rather than a product. 

Proposition 26.6 (Contraction Mapping Principle) Let F be a closed sub- 


set of a complete metric space (. M,d ). Let f : F — 
such that 

d(f(x)J(y)) < kd(x,y) 

» F be a k-contractive map 

for some fixed 0 < k < 1. Then 

1 ) there is exactly one Xq G F such that f(x 0 ) = Xq. Thus Xq is a fixed point 
for f. Furthermore, 

2) for any y G F the sequence y n := f n (y) converges to the fixed point xq 
with the error estimate d{y n ,x 0 ) < ^zrrd(yi, xq). 

Proof. Let y G F. By iteration 


d(f n (y)J n -\y)) < kd(r-\y),r~ 2 (y )) < 

• • • < k n ~ 1 d(f(y), y) 

as follows: 


d(f n+j+1 (y)J n (y )) < d(r+i +1 (y),r+ffy)) + 

■■■ + d(f n+1 (y),f n (y)) 


<(ki + 1 + --- + k)d(r(y),r- 1 (y)) 

< d{f n {y),f n -\y )) 

k n 

Y^ d (f\y), y )) 
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From this, and the fact that 0 < k < 1, one can conclude that the sequence 
f n (y ) = x n is Cauchy. Thus f n (y) — > Xq for some Xo which is in F since F is 
closed. On the other hand, 

x 0 = lim i f n {y) = lim /(/ n_1 (y)) = f(x 0 ) 

n — »0 n—> 0 

by continuity of /. Thus Xq is a hxed point. If uq where also a hxed point then 
d{xo, uo) = d(f(x 0 )J(u 0 )) < kd(x 0 ,u 0 ) 

which forces xq = uq. The error estimate in (2) of the statement of the theorem 
is left as an easy exercise. ■ 

Remark 26.9 Note that a Lipschitz map f may not satisfy the hypotheses of 
the last theorem even if k < 1 since U is not a complete metric space unless 

U= E. 

Definition 26.40 A continuous map f : U — > E such that Lf := id u~f is 
injective has a not necessarily continuous inverse Gf and the invertible map 
Rf := idE —Gf will be called the resolvent operator for f. 

The resolvent is a term that is usually used in the context of linear maps and 
the definition in that context may vary slightly. Namely, what we have defined 
here would be the resolvent of ±T/. Be that as it may, we have the following 
useful result. 

Theorem 26.8 Let E be a Banach space. If f : E — > E is continuous map that 
is Lipschitz continuous with constant k where 0 < k < 1, then the resolvent Rf 
exists and is Lipschitz continuous with constant jiff . 

Proof. Consider the equation x — f(x) = y. We claim that for any y £ E 
this equation has a unique solution. This follows because the map F : E — > E 
defined by F(x) = f(x) + y is /c-contractive on the complete normed space E as 
a result of the hypotheses. Thus by the contraction mapping principle there is 
a unique x fixed by F which means a unique x such that f{x) + y = x. Thus 
the inverse Gf exists and is defined on all of E. Let Rf := idg —Gf and choose 
3 / 1 , j /2 G E and corresponding unique Xi , i = 1,2 with ay — /( xf) = y,;. We have 

\\R f {yi) - Rf(y 2 ) || = II f{xi) - f(x 2 ) || 

<k\\xi-x 2 \\ < 

< k 1 1 j/i - Rf{yi) - ( 2/2 - RfinM < 

< k H 2/1 - 2 / 2 1 1 + \\Rf(yi) - Rf{y 2 )\\ ■ 

Solving this inequality we get 

\\Rf(vi) - -R/MII < lli/i - sail • 
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Lemma 26.4 The space GL(E, F) of continuous linear isomorphisms is an open 
subset of the Banach space L(E,F). In particular, if || id — A|| < 1 for some 
A £ GL(E) then A~ x = lini/v^oo ]C^ =0 (id -A) n . 

Proof. Let Aq £ GL(E, F). The map A i— > Ag 1 o A is continuous and maps 
GL(E, F) onto GL(E, F). If follows that we may assume that E = F and that 
Aq = id£. Our task is to show that elements of L(E, E) close enough to idE are 
in fact elements of GL(E). For this we show that 

Hid - A|| <1 

implies that A £ GL(E). We use the fact that the norm on L(E, E) is an algebra 
norm. Thus o A 2 || < ||^4.! || ||A 2 || for all A\,A 2 £ L(E, E). We abbreviate id 
by “1” and denote id — A by A. Let A 2 := A o A , A 3 := A o A o A and so forth. 
We now form a Neumann series : 

7T 0 = 1 
7Tl = 1 + A 

7T 2 = 1 + A + A 2 


TT n = 1 + A + A 2 + • • • + A”. 


By comparison with the Neumann series of real numbers formed in the same 
way using ||A|| instead of A we see that {tt„} is a Cauchy sequence since ||A|| = 
1 1 id — A 1 1 < 1. Thus {7r„} is convergent to some element p. Now we have 
(1 — A)7r„ = 1 — A™ +1 and letting n — > oo we see that (1 — A )p = 1 or in other 
words, Ap = 1. ■ 


Lemma 26.5 The map inv : GL(E, F) — > GL(E, F) given by taking inverses is a 
C°°map and the derivative of inv :g i— > g -1 at. some go £ GL(E, F) is the linear 
map given by the formida: D\mr\ gn :A>—> —gf 1 Agf 1 . 

Proof. Suppose that we can show that the result is true for go = id. Then 
pick any ho £ GL(E, F) and consider the isomorphisms Lh 0 : GL(E) — > GL(E, F) 
and R h - 1 : GL(E) — » GL(E, F) given by 0 i — > ho<j> and <f> i— > respectively. 

The map g i— > g~ l can be decomposed as 


invE / , - 1 \ 

1 9 ^ {h 0 og) 




, -1 
h Q 


1 l 


i o^o 1 — 9 


-l 


Now suppose that we have the result at go = id in GL(E). This means that 
DinvE | '-A — > —A. Now by the chain rule we have 


( Dinv lh 0 )' A = D ( R h^ oinv E°L h -i) A 

= ( R h - 1 ° Din ^L ° L h ~ l ) ' A 
= R h^ ° A ) ° = Ah , 0 
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so the result is true for an arbitrary ho £ GL(E, F). Thus we are reduced to 
showing that Z?invE| id :At—> —A. The definition of derivative leads us to check 
that the following limit is zero. 

1 1 (id +A) — 1 — (id) -1 — (—A) || 

hm n t n ■ 

II A||— >0 II A|| 


Note that for small enough ||A||, the inverse (id+A) -1 exists and so the above 
limit makes sense. By our previous result (26.4) the above difference quotient 
becomes 


(id+A) -1 - id+A 

hm m t n 

l|A||->o || A || 

||E” 0 (id-(id+A))"-id+A|| 

HA®o || A|| 

lim IISn=o(-A)"-id+A|| 

IIAIHO || A|| 


= lim 

l|A||-0 


IE” 2 (~A) r 

l|A|| 


< lim 

l|A||->0 


E” =2 IIA|| t 
II A|| 


= lim 

||A||->0 


Eii A ir 


lim JAL 
II All— o 1 — || A|| 


= 0. 


Theorem 26.9 (Inverse Mapping Theorem) Let E and F be Banach spaces 
and f : U — » F be a C r mapping defined an open set U C E. Suppose that x 0 £ U 
and that /'(x o) = Df \ : E — > F is a continuous linear isomorphism. Then 

there exists an open set V C U with Xq £ V such that f : V — > f(V) C F 
is a C r — diffeomorphism. Furthermore the derivative of f -1 at y is given by 

Df-'\ y = W\, 

Proof. By considering (Df l^) -1 of and by composing with translations we 
may as well just assume from the start that / : E — > E with Xo = 0, /( 0) = 0 
and Df | 0 = id#- Now if we let g = x — /( x), then Dg | 0 = 0 and so if r > 0 is 
small enough then 

\\Dg \ oil < | 

for x £ B( 0, 2r). The mean value theorem now tells us that || < 7 ( 0 : 2 ) — 5 ( 2 : 1 ) || < 
| ||x 2 — a?i || for X 2 , x\ £ B(0,r) and that g(B(0,r)) C B(0,r/2). Let yo € 
B(0, r/2). It is not hard to show that the map c : x 1 — > yo + x — f(x) is a 
contraction mapping c : f?(0,r) — > B( 0,?’) with constant ^ . The contraction 
mapping principle 26.6 says that c has a unique fixed point xq G 13(0, r). But 
c(x 0 ) = xo just translates to yo + Xo — f(x 0 ) = £0 and then /( xq) = yo- So xo 
is the unique element of B(0,r) satisfying this equation. But then since yo £ 
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B(0,r/2) was an arbitrary element of B(0 1 r/2) it follows that the restriction 
/ : B(0,r/2) — > f(B(0,r/2)) is invertible. But / -1 is also continuous since 

||/ _1 M -/ _1 (yi)|| = \\xi~xi\\ 

< II f(x 2 ) - f(x i)|| + \\g(x 2 ) - g(xi)\\ 

< \\f( x 2) — /(*i)|| + ^ 11*2 — *l|| 

= 1 1 1/2 - J/ill + | j j / 1 (1/2) - ,T 1 (z/i)|| 

Thus || / — 1 ( 3 / 2 ) — / _1 (y i)|| < 2 H 2/2 — j/i || and so / -1 is continuous. In fact, f~ x 
is also differentiable on S(0,r/2). To see this let f(x 2 ) = y 2 and f(x 1 ) = yi 
with x 2 ,xi £ B(0,r) and y 2 , yi £ B(0,r/2). The norm of Df(x 1)) -1 is bounded 
(by continuity) on B(0,r) by some number B. Setting x 2 — X\ = Ax and 
2/2 — 2/1 = Ay and using (Df(xf))~ 1 Df(x\) = id we have 

|jr 1 ( 2 / 2 ) - f~ 1 (yi) - (Df(x i)) _1 • Ay 1 1 
= || Ax - (D/(xi)) _ 1 (/(x 2 ) - /(xi))|| 

= \\{(Df(x 1 ))~ 1 Df(x 1 )}Ax - {(Df(x 1 ))- 1 Df(x 1 )}{Df(x 1 ))- 1 (f(x 2 ) - f(x 1 )) 
< B \\D f{x\)Ax - (/(x 2 ) - f{x i))|| < o(Ax) = o(Ay) (by continuity). 

Thus ) exists and is equal to (Df(x 1)) -1 = (Df(f~ 1 (yi)))~ 1 . A simple 

argument using this last equation shows that Df^ 1 (yi) depends continuously 
on yi and so f~ 1 is C 1 . The fact that / -1 is actually C r follows from a simple 
induction argument that uses the fact that Df is C r ~ 1 together with lemma 
26.5. This last step is left to the reader. ■ 

Exercise 26.5 Complete the last part of the proof of theorem 

Corollary 26.2 Let U C E be an open set and 0 £ U. Suppose that /:£/—> F 
is differentiable with Df(p) : E — » F a (bounded) linear isomorphism for each 
p £ U. Then f is a local diffeomorphism. 

Theorem 26.10 (Implicit Function Theorem I) Let Ei,E 2 and F be Ba- 
nach spaces and U x V C Ei x E 2 open. Let f : U x V F be a C r mapping 
such that /(xo,yo) = 0. If -D 2 /(x 0 , yo ) : E 2 — > F is a continuous linear isomor- 
phism then there exists a (possibly smaller) open set Uq C U with x 0 € Uq and 
unique mapping g : Uq — > V with g(xffj = yo and such that 

/(x,s(x)) = 0 


for all x £ Uq. 

Proof. Follows from the following theorem. ■ 

Theorem 26.11 (Implicit Function Theorem II) Let Ei, E 2 and F be Ba- 
nach spaces and U x V C Ei x E 2 open. Let f : U xV ^ F be a C r mapping such 
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that f(x o,yo) = wo- // Z? 2 /(xo, yo) : E 2 — > F is a continuous linear isomorphism 
then there exists (possibly smaller) open sets Uq C U and Wo C F with Xo € Uq 
and wo £ Wo together with a unique mapping g : Uq x Wo — > P swc/i that 

/(x,ff(x,w)) = w 

for all x £ Uq- Here unique means that any other such function h defined on a 
neighborhood U( x ITq equal g on some neighborhood o/(xo,wo). 

Proof. Sketch: Let $:t/xk-+EixFbe defined by \E'(x, y) = (x, /(x, y)). 
Then _DT(xo,yo) has the operator matrix 

idgj 0 

L>i/(xo,y 0 ) D 2 f(x 0 , y 0 ) _ 

which shows that D\E'(xo,yo) is an isomorphism. Thus T has a unique local 
inverse which we may take to be defined on a product set Uo x Wo- Now 
T -1 must have the form (x, y) 1 — > (x, <?(x, y)) which means that (x, /(x, g(x, w))) = 
\l/(x, g(x, w)) = (x, w). Thus f(x,g(x, w)) = w. The fact that g is unique follows 
from the local uniqueness of the inverse T -1 and is left as an exercise. ■ 

Let U be an open subset of a Banach space V and let I C ffi. be an open 
interval containing 0. A (local) time dependent vector field on U is a C r -m&p 
F : I x U — > V (where r > 0). An integral curve of F with initial value Xo is a 
map c defined on an open subinterval J C I also containing 0 such that 

c'(t) = F(t,c(t)) 
c(0) = xo 

A local flow for A is a map a : Iq x Uq — > V such that Uq C U and such that 
the curve a x (t) = a(t,x ) is an integral curve of F with a x ( 0) = x 

In the case of a map / : U —> V between open subsets of Euclidean spaces ( 
say K™ and R m ) we have the notion of rank at p £ U which is just the rank of 
the linear map D p f : R" — > R m . 

Definition 26.41 Let X, Y be topological spaces. When we write / :: X -4 Y 
we imply only that f is defined on some open set in X. If we wish to indicate 
that f is defined near p £ X and that /( p) = q we will used the pointed category 
notation together with the symbol % 

f :: (X, p) — > (Y, q) 

We will refer to such maps as local maps at p. Local maps may be com- 
posed with the understanding that the domain of the composite map may become 
smaller: Iff :: (X, p) -> (Y, q) and g :: (Y , q) -> (G, z) then gof :: (X. p) -> (G, z) 
and the domain of g o / will be a non-empty open set. 

Recall that for a linear map A : R n — » R m which is injective with rank r 
there exist linear isomorphisms C± : — > R m and C 2 : R" — > R” such that 

Ciodo Cf l is just a projection followed by an injection: 

= R m 


n—r 


m—r 
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We have obvious special cases when r = n or r = m. This fact has a local 
version that applies to C’ oc 'nonlinear maps. 

26.9.1 Linear case. 

Definition 26.42 We say that a continuous linear map A\ : Ei — » Fi is equiv- 
alent to a map A 2 : E 2 — » F 2 if there are continuous linear isomorphisms 
a : Ei — y E 2 and (3 : Fi — > F 2 such that A 2 = (3 o A\ o a -1 . 

Definition 26.43 Let A : E — > F be an injective continuous linear map. We 
say that A is a splitting injection if there are Banach spaces Fi and F 2 with 
F = Fi x F 2 and if A is equivalent to the injection hap : Fi — > Fi x F 2 . 

Lemma 26.6 If A : E — > F is a splitting injection as above then there exists a 
linear isomorphism 6 : F — > E x F 2 such that S o A : E — » E x F 2 is the injection 
x i— > (x, 0). 

Proof. By definition there are isomorphisms a : E — > Fi and (3 : F — » Fi x F 2 
such that (3 o A o a _1 is the injection Fi — > Fi x F 2 . Since a is an isomorphism 
we may compose as follows 

(aT 1 x id E ) o (3 o A o oT 1 o a 
= (a -1 x id E ) o (3 o A 
= So A 

to get a map which is can seen to have the correct form. 


(E,P) 

(F,q) 

a IT a 1 

IP 

(Ei, 0) 

(Fi x F 2i (0,0)) 

a 1 IT a 

| a -1 x id 

/r- 1 in Jl 

(E,p) 

(E x F 2) (p, 0)) 


If A is a splitting injection as above it easy to see that there are closed 
subspaces F E and F 2 of F such that F = Fi ® F 2 and such that A maps E iso- 
morphically onto F E . 

Definition 26.44 Let A : E — > F be an surjective continuous linear map. We 
say that A is a splitting surjection if there are Banach spaces Ei and E 2 with 
E = Ei x E 2 and if A is equivalent to the projection pr\ : Ei x E 2 — > Ei. 

Lemma 26.7 If Let A : E — > F is a splitting surjection then there is a linear 
isomorphism 6 : F x E 2 — > E such that Ao<5:FxE 2 — >Fis the projection 
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Proof. By definition there exist isomorphisms a : E — > Ei x E 2 and P : F — » 
Ei such that (3 o Ao a -1 is the projection pr\ : Ei x E 2 — > Ei. We form another 
map by composition by isomorphisms; 

/3 _1 o/Jodoa' 1 o()3, idE 2 ) 

= A o a -1 o (/?, id£ 2 ) := A o S 


and check that this does the job. Examine the diagram for guidance if you get 
lost: 


(Ei x E 2 , (0,0)) 
(/3, id E2 ) t 
(F x E 2 , (q, 0)) 


(F,q) 

r 1 u p 

(Ei , 0) 

PUP - 1 

(F,q) 


If A is a splitting surjection as above it easy to see that there are closed 
subspaces Ei and E 2 of E such that E = Ei ® E 2 and such that A maps E onto 
Ei as a projection x + y 1 — > x. 

26.9.2 Local (nonlinear) case. 

Definition 26.45 Let f\ : (Ei,p 1 )— > (F 1 ,q 1 ) be a local map. We say that /j 
is locally equivalent near pi to / 2 : (E 2 , p 2 )— >(F 2 , q 2 ) if there exist local diffeo- 
morphisms a : Ei — > E 2 and P : Fi — > F 2 such that f\ = a o f 2 o /3 _1 (near p) or 
equivalently if f 2 = P 0 fi 0 c® 1 (near p 2 ). 

Definition 26.46 Let f :: E, p — > F, q be a local map. We say that f is a lo- 
cally splitting injection or local immersion if there are Banach spaces Fi 
and F 2 with F = Fi x F 2 and if f is locally equivalent near p to the injection 
inji :: (Fi,0) -> (Fi x F 2 ,0). 


By restricting the maps to possibly smaller open sets we can arrange that 
(a — ^ f /3 

P o f o a -1 is given by U' — > U V d-* U' x V' which we will call a nice local 
injection. 

Lemma 26.8 If f is a locally splitting injection as above there is an open set 
U\ containing p and local diffeomorphism Lp : U\ C F — * t/ 2 C E x F 2 and such 
that p o /(x) = (x, 0) for all x G Ui. 

Proof. This is done using the same idea as in the proof of lemma 26.6. ■ 

Definition 26.47 Let f :: (E, p) — » (F,q) be a local map. We say that f is a 
locally splitting surjection or local submersion if there are Banach spaces 
Ei and E 2 with E = Ei x E 2 and if f is locally equivalent (at p) to the projection 

W\ ■■ Ei x E 2 — > Ei. 
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Again, by restriction of the domains to smaller open sets we can arrange 

that projection 0 o f o aT 1 is given by U' x V' U T V U' which we will 
call a nice local projection. 

Lemma 26.9 If f is a locally splitting surjection as above there are open sets 
U\ x U 2 C F x E 2 and V C F together with a local diffeomorphism ip : U\ x U 2 C 
F x E 2 — > V C E such that f o tp( u, v) = u for all (u, v) £ U\ x Ui. 

Proof. This is the local (nonlinear) version of lemma 26.7 and is proved 
just as easily. ■ 

Theorem 26.12 (local immersion) Let f :: (E, p) — > (F, q) be a local map. If 
Df | : (E, p) — > (F, q) is a splitting injection then f :: (E, p) — » (F,q) is a local 
immersion. 

Theorem 26.13 (local immersion- finite dimensional case) Let.f :: R" - 
R" xR fc be a map of constant rank n in some neighborhood of 0 S R". Then there 
is gi :: R" — > R™ with g i(0) = 0, and a gi :: R ra — > R ra xR fc such that c/ 2 0 f 0 gf 1 
is just given by x 1 — > (x, 0) £ R”xR fc . 

We have a similar but complementary theorem which we state in a slightly 
more informal manner. 

Theorem 26.14 (local submersion) Let f :: (E,p) — » (F.q) be a local map. 
If Df l p : (E, p) — * (F, q) is a splitting surjection then f :: (E, p) — > (F.q) is a 
local submersion. 

Theorem 26.15 (local submersion -finite dimensional case) Let f :: (R n 

(R",0) be a local map with constant rank n near 0. Then there are dijfeomor- 
phisms gi :: (R"xR fc ,0) — > (R™xR fe ,0) and g 2 :: (R",0) — + (R",0) such that 
near 0 the map 

92 0 f 0 gf 1 (R ra x R fc , 0) —> (R" , 0) 

is just the projection (x,y) 1 — > x. 

If the reader thinks about what is meant by local immersion and local sub- 
mersion he/she will realize that in each case the derivative map Df p has full 
rank. That is, the rank of the Jacobian matrix in either case is a big as the 
dimensions of the spaces involved allow. Now rank is only a semicontinuous and 
this is what makes full rank extend from points out onto neighborhoods so to 
speak. On the other hand, we can get more general maps into the picture if we 
explicitly assume that the rank is locally constant. 

26.10 The Tangent Bundle of an Open Subset 
of a Banach Space 

Later on we will define the notion of a tangent space and tangent bundle for a 
differentiable manifold which locally looks like a Banach space. Here we give a 
definition that applies to the case of an open set U in a Banach space. 


xR fc , 0) 
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Definition 26.48 Let E be a Banach space and U C E an open subset. A 
tangent vector at x G U is a pair (x, v) where v G E. The tangent space at 
x G U is defined to be T X U := T x E := {x} x E and the tangent bundle TU over 
U is the union of the tangent spaces and so is just TU = U x E. Similarly the 
cotangent bundle over U is defined to be T*U = U x E*. A tangent space T x E 
is also sometimes called the fiber at x. 

We give this definition in anticipation of our study of the tangent space at a 
point of a differentiable manifold. In this case however, it is often not necessary 
to distinguish between T X U and E since we can often tell from context that an 
element v G E is to be interpreted as based at some point x € U. For instance a 
vector field in this setting is just a map X : U — » E but X(x) should be thought 
of as based at x. 

Definition 26.49 If f : U — > F is a C r map into a Banach space F then the 
tangent map T f : TU — » TF is defined by 

Tf ■ (x, v) = (/ (x) , Df (x) • v) . 

The map takes the tangent space T X U = T X E linearly into the tangent space 
T /(x) F for each x G U. The projection onto the first factor is written tjj : TU = 
U x E — > U and given by rj/(x, v) = x. We also have a projection ttjj : T*U = 
U x E*— >U defined similarly. 

If / : U — > V is a diffeomorphism of open sets U and V in E and F respectively 
then Tf is a diffeomorphism that is linear on the fibers and such that we have 
a commutative diagram: 

TU = U xE V x F =TV 
Pn | I pri 

U — > V 

f 

The pair is an example of what is called a local bundle map. In this context 
we will denote the projection map TU = U x E — * U by tjj- 
The chain rule looks much better if we use the tangent map: 

Theorem 26.16 Let U\ and U 2 be open subsets of Banach spaces Ei and E 2 
respectively. Suppose we have differentiable (resp. C r ,r > 1) maps composing 
as 

U x 4 U 2 ^ E 3 

where E 3 is a third Banach space. Then the composition is go f differentiable 
(resp. C r , r > 1) and T(g o f) = Tg o Tf 

TUi TU 2 ^ TE 3 

n/i I tu 2 I l te 3 

U x 4 U 2 4 E 3 
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Notation 26.4 (and convention) There are three ways to express the “differ- 
ential/derivative” of a differentiable map f : U C E — > F. 

1. The first is just Df : E — > F or more precisely Df | x : E — > F for any point 
x gU. 

2. This one is new for us. It is common but not completely standard : 

dF : TU — + F 

This is just the map (x, v) — » Df | x v. We will use this notation also in 
the setting of maps from manifolds into vector spaces where there is a 
canonical trivialization of the tangent bundle of the target manifold (all 
of these terms will be defined). The most overused symbol for various 
“differentials” is d. We will use this in connection with Lie group also. 

3. Lastly the tangent map Tf : TU — > T F which we defined above. This is 
the one that generalizes to manifolds without problems. 

In the local setting that we are studying now these three all contain es- 
sentially the same information so the choice to use one over the other is 
merely aesthetic. 

It should be noted that some authors use df to mean any of the above 
maps and their counterparts in the general manifold setting. This leads to less 
confusion than one might think since one always has context on one’s side. 

26.11 Problem Set 

Solution set 1 

1. Find the matrix that represents the derivative the map / : R" — > R m 
given by 

a) f(x) = Ax for an m x n matrix A. 

b) f(x) = x t Ax for an n x n matrix A (here m = 1) 

c) f(x) = x l x 2 ■ ■ ■ x n (here to. = 1) 

2. Find the derivative of the map F : L 2 {[ 0, 1]) — > L 2 ([ 0, 1]) given by 

F[f] (x)=[ k(x, y)f{y)dy 
Jo 

where k(x,y) is a bounded continuous function on [0, 1] x [0, 1]. Hint: F 
is linear! 

3. Let L : R™ x R n x R — > R be C°° and define 

S[c] = f L(c(t),c'(t),t)dt 
Jo 
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Tangent 

map 


Figure 26.1: Versions of the “derivative” map. 
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which is defined on the Banach space B of all C 1 curves c : [0, 1] — » R n with 
c(0) = 0 and c(l) = 0 and with the norm ||c|| = sup te r 01 i{|c(t)| + |c'(f)|}. 
Find a function g c : [0, 1] — > R n such that 

DS\ c -b= [ (g c (t),b(t))dt 
Jo 

4. In the last problem, if we hadn’t insisted that c(0) = 0 and c(l) = 0, but 
rather that c(0) = Xq and c(l) = X\, then the space wouldn’t even have 
been a vector space let alone a Banach space. But this fixed endpoint 
family of curves is exactly what is usually considered for functionals of 
this type. Anyway, convince yourself that this is not a serious problem by 
using the notion of an affine space (like a vector space but no origin and 
only differences are defined, (look it up)). Is the tangent space of the this 
space of fixed endpoint curves a Banach space? (hint: yep!) 

Hint: If we choose a fixed curve Cq which is the point in the Banach space 
at which we wish to take the derivative then we can write B% 0 $ 1 = B + cq 
where 


Bs 0 X! = {c : c(0) = Xo and c(l) = a?i} 

B = {c: c(0) = 0 and c(l) = 0} 

Then we have T Cq Bs oXi — B. Thus we should consider DS \ Cq : B — > B. 

5. Let Gl(n , R) be the nonsingular n x n matrices and show that Gl(n, R) is 
an open subset of the vector space of all matrices M nxn (R) and then find 
the derivative of the determinant map: det : G7(n, R) — > R (for each A 
this should end up being a linear map Hdetl^ : M nxn ( R) — > R). 

What is det X where X = ( Xij ) ? 

6. Let A : U CE— > L(F,F) be a C r map and define F : U x F — > F by 
F(u,f) := A{u)f. Show that F is also C r . 

Hint: Leibnitz rule theorem. 

26.11.1 Existence and uniqueness for differential equations 

Theorem 26.17 Let E be a Banach space and let X : U C E — » E be a 
smooth map. Given any Xq G U there is a smooth curve c : (— e, e) — > U with 
c(0) = Xq such that c'{t) = X(c(t)) for all t G (— e, e). If C\ : (— ei,ei) — > V 
is another such curve with ci(0) = Xo and c^ft) = X(c(t)) for all t G (— ei,ei) 
then c = Ci on the intersection (— ei,ei) f~l (— e, e). Furthermore, there is an 
open set V with Xq G V C U and a smooth map $ : E x (—a, a) — * U such that 
1 1 — > c x (t) := ^(atf) is a curve satisfying d (t) = X(c(t)) for all t G (—a, a). 
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26.11.2 Differential equations depending on a parameter. 

Theorem 26.18 Let J be an open interval on the real line containing 0 and 
suppose that for some Banach spaces E and F we have a smooth map F : JxU x 
V — > F where U C E and V C F. Given any fixed point (xo,j/o) G U x V there 
exist a subinterval Jq C J containing 0 and open balls B\ C U and ftCf with 
(xo,2/o) G B\ x B 2 and a unique smooth map 

/3 Jo x B\ x .62 — * V 


such that 

1) = F(t,x, p(t,x,y)) for all ( t,x,y ) € Jo x £>i x i ?2 and 

2) /3(0,x,y) = y. 

Furthermore, 

3) if we let f3(t , x) := f3(t, x, y) for fixed y then 

^-D 2 (3(t, x)-v = D 2 F{t , x, /?(f , x)) • x 
dt 

+ D 3 F(t, x, P(t, x)) • D 2 p(t , x) • x 


for all v € E. 

26.12 Multilinear Algebra 

There a just a few things from multilinear algebra that are most important for 
differential geometry. Multilinear spaces and operations may be defined starting 
with in the category vector spaces and linear maps but we are also interested in 
vector bundles and their sections. At this point we may consider the algebraic 
structure possessed by the sections of the bundle. What we have then is not 
(only) a vector space but (also) a module over the ring of smooth functions. 
The algebraic operation we perform on this level are very similar to vector space 
calculations but instead of the scalars being the real (or complex) numbers the 
scalars are functions. So even though we could just start defining things for 
vector spaces it will be more efficient to consider modules over a commutative 
ring R since vector spaces are also modules whose scalar ring just happens to be 
a field. We remind the reader that unlike algebraic fields like R or C there may 
be elements a and b in a ring such that ab = 0 and yet neither a nor b is zero. 
The ring we mostly interested in is the ring of smooth functions on a manifold. 

The set of all vector fields X(M) on a manifold is a vector space over the real 
numbers. But £(M) has more structure since not only can we add vector fields 
and scale by numbers but we may also scale by smooth functions. We say that 
the vector fields form a module over the ring of smooth functions. A module is 
similar to a vector space with the differences stemming from the use of elements 
of a ring R of the scalars rather than a field (such as the complex C or real 
numbers R). For a module, one still has lrc = w, Ore = 0 and —Ire = —re. Of 
course, every vector space is also a module since the latter is a generalization of 
the notion of vector space. 
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Definition 26.50 Let R be a ring. A left R-module (or a left module over 

R) is an abelian group W,+ together with an operation R x W — > W written 

(a, w) i— > aw such that 

1) (a + b)w = aw + bw for all a, b gR and all w £ W, 

2) a(w i + W 2 ) = aw 1 + aW 2 for all a GR and all W 2 , w 1 £ W. 

A right R-module is defined similarly with the multiplication of the right so 
that 

1 ) w{a + b) = wa + wb for all a,b gR and all w £ W, 

2) (wi + W 2 )a = Wi a + W 2 a for all a GR and all W 2 , w 1 £ W. . 

If the ring is commutative (the usual case for us) then we may write aw = wa 
and consider any right module as a left module and visa versa. Even if the ring 
is not commutative we will usually stick to left modules and so we drop the 
reference to “left” and refer to such as R- modules. 

Remark 26.10 We shall often refer to the elements of R as scalars. 

Example 26.10 An abelian group A, + is a Z module and a Z-module is none 
other than an abelian group. Here we take the product of n G Z with x G A to 
be nx := x + ■■■+ x if n > 0 and nx := — (a: + • • • + x) if n < 0 (in either case 
we are adding |n| terms). 

Example 26.11 The set of all m x n matrices with entries being elements of 
a commutative ring R (for example real polynomials) is an R-module. 

Example 26.12 The set of all module homomorphisms of a module W onto an- 
other module M is a module and is denoted Homc>oo(M)(W, M) or £c°°(m)(W,M) 
In particular, we have HoniR(r(£i),r(£ 2 )) where £1 and £1 are vector bundles 
over a smooth manifold M. 

Example 26.13 Let V be a vector space and t : V — > V a linear operator. Using 
this one operator we may consider V as a module over the ring of polynomials 
R[t] by defining the “ scalar ” multiplication by 

p(t)v := p(t)v 

for p £ R[f] . For example, (1 + t 2 )v = v + I 2 v. 

Since the ring is usually fixed we often omit mention of the ring. In particu- 
lar, we often abbreviate Lr(W, M) to L(W, M). Similar omissions will be made 
without further mention. Also, since every real (resp. complex) Banach space E 
is a vector space and hence a module over R (resp. C) we must distinguish be- 
tween the bounded linear maps which we have denoted up until now as L( E; F) 
and the linear maps which would be denoted the same way in the context of 
modules. Our convention will be the following: 

In case the modules in question are presented as infinite dimensional topo- 
logical vector spaces, say E and F, we will let L( E; F) continue to mean the space 
of bounded linear operator unless otherwise stated. 
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A submodule is defined in the obvious way as a subset S C W which 
is closed under the operations inherited from W so that S itself is a module. 
The intersection of all submodules containing a subset A C W is called the 
submodule generated by A and is denoted (A) and A is called a generating 
set. If (A) = W for a finite set A , then we say that W is finitely generated. 

Let S' be a submodule of W and consider the quotient abelian group W/S 
consisting of cosets, that is sets of the form [u] := v + S = {v + x : x € S} 
with addition given by [w] + [to] = [v + w\ . We define a scalar multiplication by 
elements of the ring R by o[u] := [az>] respectively. In this way, W/S is a module 
called a quotient module. 

Definition 26.51 Let Wi and W -2 be modules over a ring R. A map L : W\ — * 
Wi is called module homomorphism if 

L(awi + bw 2) = aL(wi) + bL(w 2). 

By analogy with the case of vector spaces, which module theory includes, we 
often characterize a module homomorphism L by saying that L is linear over 
R. 


A real (resp. complex) vector space is none other than a module over the 
field of real numbers R. (resp. complex numbers C). In fact, most of the modules 
we encounter will be either vector spaces or spaces of sections of some vector 
bundle. 

Many of the operations that exist for vector spaces have analogues in the 
module category. For example, the direct sum of modules is defined in the 
obvious way. Also, for any module homomorphism L : Wi — > W 2 we have the 
usual notions of kernel and image: 

kerl, = {v € Wi : L(v) = 0} 

img(L) = L(Wi) = {u> € W 2 : w = Lv for some v £ Wi}. 

These are submodules of Wi and W 2 respectively. 

On the other hand, modules are generally not as simple to study as vector 
spaces. For example, there are several notions of dimension. The following 
notions for a vector space all lead to the same notion of dimension. For a 
completely general module these are all potentially different notions: 

1. Length of the longest chain of submodules 


0 = W n C • • • C Wi C W 


2. The cardinality of the largest linearly independent set (see below). 

3. The cardinality of a basis (see below) . 

For simplicity in our study of dimension, let us now assume that R is com- 
mutative. 



26.12. MULTILINEAR ALGEBRA 


421 


Definition 26.52 A set of elements W\,...,Wk of a module are said to be lin- 
early dependent if there exist ring elements r\,...,rk £ R not all zero, such 
that r\W\ + ■ ■ ■ + rkWk = 0. Otherwise, they are said to be linearly indepen- 
dent. We also speak of the set {u>i, Wk} as being a linearly independent 
set. 

So far so good but it is important to realize that just because w i,...,Wk 
are linearly independent doesn’t mean that we may write each of these w, as a 
linear combination of the others. It may even be that some element w forms a 
linearly dependent set since there may be a nonzero r such that rw = 0 (such a 
w is said to have torsion). 

If a linearly independent set {w \, ..., Wk} is maximal in size then we say that 
the module has rank k. Another strange possibility is that a maximal linearly 
independent set may not be a generating set for the module and hence may not 
be a basis in the sense to be defined below. The point is that although for an 
arbitrary w £ W we must have that {wi, u>fc}U{u>} is linearly dependent and 
hence there must be a nontrivial expression rw + r\W i + • • • + r^Wk = 0, it does 
not follow that we may solve for w since r may not be an invertible element of 
the ring (i.e. it may not be a unit). 

Definition 26.53 If B is a generating set for a module W such that every ele- 
ment of W has a unique expression as a finite R-linear combination of elements 
of B then we say that B is a basis for W. 

Definition 26.54 If an R-module has a basis then it is referred to as a free 
module. 

If a module over a (commutative) ring R has a basis then the number of 
elements in the basis is called the dimension and must in this case be the same 
as the rank (the size of a maximal linearly independent set). If a module W is 
free with basis w i, ..., w n then we have an isomorphism R" = W given by 

(ri, ..., r n ) nw i H 1- r n w n . 

Exercise 26.6 Show that every finitely generated R-module is the homomorphic 
image of a free module. 

It turns out that just as for vector spaces the cardinality of a basis for a free 
module W over a commutative ring is the same as that of every other basis for 
W. The cardinality of any basis for a free module W is called the dimension of 
W. If R is a field then every module is free and is a vector space by definition. 
In this case, the current definitions of dimension and basis coincide with the 
usual ones. 

The ring R is itself a free R-module with standard basis given by {1}. Also, 
R n :=R x • • • x R is a free module with standard basis {ei, ...., e„} where, as usual 
e-i := (0, ..., 1, ...,0); the only nonzero entry being in the i-tli position. 
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Definition 26.55 Let k be a commutative ring, for example a field such as R 
or C. A ring A is called a k-algebra if there is a ring homomorphism p : k — >/? 
such that the image p(k) consists of elements which commute with everything 
in A. In particular, A is a module over k. 

Example 26.14 The ring Cf^(U) is an R- algebra . 

We shall have occasion to consider A-modules where A is an algebra over 
some k. In this context the elements of A are still called scalars but the elements 
of k CA will be referred to as constants. 

Example 26.15 For an open set U C M the set vector fields Xm(U) is a vector 
space over R but it is also a module over the R -algebra Cf^(U) . So for all X,Y £ 
Xm{U) and all f,g £ C^(If) we have 

1. f(X + Y) = fX + fY 

2. (f + g)X = fX + gX 

3. f(gX) = ( fg)X 

Similarly, X* M (U) = T(U,T* M) is also a module over Cf^{U) which is nat- 
urally identified with the module dual Xm(U)* by the pairing ( 9,X ) i— > 6(X). 
Here 0(X) £ C^(U) and is defined by pn 0 p (X p ). The set of all vector fields 
Z C X(U) which are zero at a fixed point p £ U is a submodule in X(U). If 
U, (a: 1 , ... ,x n ) is a coordinate chart then the set of vector fields 

d d 
dx 1 ’ ’ dx n 

is a basis (over Cf^{U)) for the module X(U). Similarly, 

dx 1 , ..., dx n 

is a basis for X* M (JJ). It is important to realize that if U is not a coordinate chart 
domain then it may be that %m(U) and Xm{U)* have no basis. In particular, 
we should not expect X(M) to have a basis in the general case. 

Example 26.16 The sections of any vector bundle over a manifold M form a 
C°° (M ) -module denoted T(E). Let E — > M be a trivial vector bundle of finite 
rank n. Then there exists a basis of vector fields E ±, ..., E n for the module T(E). 
Thus for any section X there exist unique functions /* such that 

X = Y J fEi- 


In fact, since E is trivial we may as well assume that E = M x R" M . Then 
for any basis e\,...,e n for R™ we may take 


Ei{x) := (x,ei). 
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In this section we intend all modules to be treated strictly as modules. Thus 
we do not require multilinear maps to be bounded. In particular, Lr(Vi, . . . , V fc ; W) 
does not refer to bounded multilinear maps even if the modules are coinciden- 
tally Banach spaces. We shall comment on how thing look in the Banach space 
category in a later section. 

Definition 26.56 The dual of an R— module W is the module W* := HoniR(W,R) 
of all R -linear functionals on W. 

Definition 26.57 If the map () : W — > W** := HoniR(W*,R) given by 

w(a) := a(w) 

is an isomorphism then we say that W is reflexive . If W is reflexive then 
we are free to identify W with W** and consider any w £ W as map w = w : 
W* — > R. 

Exercise 26.7 Show that if W is a free with finite dimension then W is reflex- 
ive. We sometimes write w(a ) = (w,a) = ( a,w ). 

There is a bit of uncertainty about how to use the word “tensor”. On the 
one hand, a tensor is a certain kind of multilinear mapping. On the other hand, 
a tensor is an element of a tensor product (defined below) of several copies of a 
module and its dual. For finite dimensional vector spaces these two viewpoints 
turn out to be equivalent as we shall see but since we are also interested in infinite 
dimensional spaces we must make a terminological distinction. We make the 
following slightly nonstandard definition: 

Definition 26.58 Let V and W be R -modides. A W -valued ( r ) -tensor map 
on\ is a multilinear mapping of the form 


A:V*xV*---xV*xVxVx---xV— > W. 

X ^ X X ^ X 

r— times s— times 

The set of all tensor maps into W will be denoted T” s (V; W). Similarly, a 
W-valued ( r )-tensor map on V is a multilinear mapping of the form 

A:VxVx---xVxV*xV*---xV*— >W 

V v ✓ V v ✓ 

s— times r— times 

and the corresponding space of all such is denoted T s r (V; W). 

There is, of course, a natural isomorphism T r (V; W) =T r (V; W) induced 
by the map V s x y* r ^y*r x V s given on homogeneous elements by v (g> to i— > 
u> ® v. (Warning) In the presence of an inner product there is another possible 
isomorphism here given by v® u> i— > \>v ® j)u> This map is a “transpose” map 
and just as we do not identify a matrix with its transpose we do not 
generally identify individual elements under this isomorphism. 
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Notation 26.5 For the most part, we shall be needing only T r (V; W) and so 
we agree abbreviate this to TJ(V; W) and call the elements (r, s) -tensor maps. 
So by convention 

T;(V;W) := T r s (V; W) 
but 

T;(V;W)^T/(V;W). 

Elements of Tq(V) are said to be of contravariant type and of degree r 
and those in T°(V) are of covariant type (and degree s). If r, s > 0 then the 
elements of TJ(V) are called mixed tensors (of tensors of mixed type) 
with contravariant degree r and covariant degree s. 

Remark 26.11 An R -valued (r, s)-tensor map is usually just called an (r, s)- 
tensor but as we mentioned above, there is another common meaning for this 
term which is equivalent in the case of finite dimensional vector spaces. The 
word tensor is also used to mean “tensor field” (defined below). The context 
will determine the proper meaning. 

Remark 26.12 The space Tf (V: R) is sometimes denoted by TJ(V; R) (or even 
TJ (V) in case R but we reserve this notation for another space defined 
below which is canonically isomorphic to TJ(V; R) in case V is free with finite 
dimension. 

Definition 26.59 Given Ti £ TJ(V; R) and Y 2 G TP(V; R) we may define the 
(consolidated) tensor product Ti (g> Y 2 £ Tf^(Y; R) by 

(Ti ® T 2 )(«i, ...,a r ,v 1 , ...,v a ) 

:= T 1 (a 1 , a r )T 2 (i> 1 , ..., v s ) 

Remark 26.13 We call this type of tensor product the “map” tensor prod- 
uct in case we need to distinguish it from the tensor product defined below. 

Now suppose that V is free with finite dimension n. Then there is a basis 
/ 1 , f n for V with dual basis f 1 , ..., f n . Now we have V* — Tf (V; R). Also, 
we may consider /) £ V** = (V*; R) and then, as above, take tensor products 

to get elements of the form 

fn ® • • • <8 fi r ® f n ® ® f 3s ■ 

These are multilinear maps in TJ(V; R) by definition: 

(fn f Js )(a 1 , ...,a r ,vi, ...,v s ) 

= a 1 (fi 1 ) ■ ■ ■ a r (f ir )f n (vi) • • • f 3s (v s ). 

There are n s n r such maps which form a basis for TJ(V; R) which is therefore 
also free. Thus we may write any tensor map T £ Tf (V: R) as a sum 


X X T ' : '-' V h,-3s fh ® ® hr ® P 1 ® ‘ ‘ ® /' 
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and the scalars T n '" v j 1 ,...j B € R 

We shall be particularly interested in the case where all the modules are 
real (or complex) vector spaces such as the tangent space at a point on a 
smooth manifold. As we mention above, we will define a space TJ(V;R) for 
each (r, s) which is canonically isomorphic to XJ (V; R) . There will be a product 
(g>TJ(V; R) xTP(V;R) — ■> Tf+£(V; R) for these spaces also and this will match up 
with the current definition under the canonical isomorphism. 

Example 26.17 The inner product (or “dot product”) on the Euclidean vector 
space R" given for vectors v = (tq, ..., v n ) and w = (w i, ..., w n ) by 

n 

( v , w) ^ ( V , w) = ^2 V i W i 

i= 1 

is 2-midtilinear (more commonly called bilinear). 

Example 26.18 For any n vectors vi, ...,v„ eR" the determinant of (v i,...,v n ) 
is defined by considering vy, ..., v n as columns and taking the determinant of the 
resulting n x n matrix. The resulting function is denoted det(vi, ..., v„) and is 
n- multilinear. 

Example 26.19 Let 3C(M) be the C°° (M)-module of vector fields on a manifold 
M and let X*(M) be the C°° (M)-module of 1 -forms on M. The map X*(M) x 
X{M) — > C°°(M) given by (a, X) i— > a(X) € C°°(M) is clearly multilinear 
(bilinear) over C°°(M). 

Suppose now that we have two R- modules Vi and V 2 . Let us construct 
a category Cy lX v 2 whose objects are bilinear maps Vi x V 2 — > W where W 
varies over all R-modules but Vi and V 2 are fixed. A morphism from, say 
pi : Vi x V 2 — > Wi to P 2 ■ Vi x V 2 — » W 2 is defined to be a map I : Wi — » W 2 
such that the diagram 

Wi 

Z 1 mi 

v, x v 2 n 

\ V2 

W 2 

commutes. Suppose that there is an R- module Tv 1 ,v 2 together with a bilinear 
map t : Vi x V 2 — > Tv 1j v 2 has the following universal property for this 
category: For every bilinear map ^ 1 : V 1 XV 2 - » W there is a unique linear map 
p : Ty 1 ,v 2 W such that the following diagram commutes: 

Vi x V 2 4 W 
U Sn 
T Vi ,v 2 


If such a pair Ty lj y 2 ,t exists with this property then it is unique up to isomor- 
phism in Cy lX y 2 . In other words, if t : Vi x V 2 — »■ Ty 1; y 2 is another object with 
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this universal property then there is a module isomorphism Ty, ,v 2 = Ty^y,, 
such that the following diagram commutes: 


T Vi,V 2 

/t 

Vi x V 2 

V 

Ty 1 ,v 2 

We refer to any such universal object as a tensor product of Vi and V 2 . We 
will construct a specific tensor product which we denote by Vi ® V 2 with the 
corresponding map denoted by® : Vi xV 2 — »Vi®V 2 . More generally, we seek 
a universal object for fc-multilinear maps p :V i x • • • xVt — > W. 

Definition 26.60 A module T = Ty 1; ... ) y fe together with a midtilinear map 
u : Vi x • • • x Vk — > T is called universal for k-multilinear maps on Vi x 
• • • x Vfc if for every midtilinear map p : V\ x • • • x Vfc — * W there is a unique 
linear map p : T — > W such that the following diagram commutes: 

Vi x • • • x V fc A W 
u I /p 

T 

i.e. we must have p = p o u. If such a universal object exists it will be called 
a tensor product of V\ ,...,Vk and the module itself T = Ty lj ... ; y fe is also 
referred to as a tensor product of the modules V\ , ...,Vfc. 

Lemma 26.10 If two modules Ti,ui andT 2 ,u 2 are both universal for k-multilinear 
maps on Vi x • • • x Vfc then there is an isomorphism 4> : Ti — > T 2 such that 
$ o m = u 2 ; 

Vi x • • • x Vfc 

ui ^ \ u 2 . 

Ti ^ T 2 

Proof. By the assumption of universality, there are maps ui and u 2 such 
that 4> oui = u 2 and 4> ou 2 = ui. We thus have o <f> oui = Ui = id and by the 
uniqueness part of the universality of ui we must have o 4> = id or 4> = 

■ 

We now show the existence of a tensor product. The specific tensor product 
of modules Vi, ...,Vfc that we construct will be denoted by Vi ® ■ ■ ■ ® Vfc and 
the corresponding map will be denoted by 

® fc : Vi x • • • x Vfc -► Vi ® • • • ® Vfc. 

We start out by introducing the notion of a free module on an arbitrary set. 

If S is just some set, then we may consider the set Fr(S) all finite formal linear 
combinations of elements of S with coefficients from R. For example, if a, b, c G R 
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and Si, S21 S3 £ S then asi + bs 2 + CS3 is such a formal linear combination. In 
general, an element of Fr(S') will be of the form 

y ^a s s 

sGS 

where the coefficients a s are elements of R and all but finitely many are 0. 
Thus the sums involved are always finite. Addition of two such expressions and 
multiplication by elements of R are defined in the obvious way; 

b a 3 s = ba s s 
sGS s£S 

Em + E b s s = y ~2(a s + b s )s. 
s£S sGS" s£.S 

This is all just a way of speaking of functions a() : S — >R with finite support. 
It is also just a means of forcing the element of our arbitrary set to be the “basis 
elements” of a modules. The resulting module Fr(S) is called the free module 
generated by S. For example, the set of all formal linear combinations of the 
set of symbols {i, j} over the real number ring, is just a 2 dimensional vector 
space with basis {i,j}. 

Let Vi , • • • ,Vfc be modules over R and let Fr(Vi x • • • x Vfc) denote the set of 
all formal linear combinations of elements of the Cartesian product Vi x • • • x V*,. 
For example 

3(vi,w) — 2(v2,w) £ F R (V, W) 

but it is not true that 3(ui,u>) — 2(v2,w) = 3(ui — 2v2,w) since (vi,w), 
(v 2 ,w) and (ui — 2v2,w) are linearly independent by definition. We now de- 
fine a submodule ofFR(Vi x ••• x Vfc). Consider the set B of all elements of 
Fr(Vi x ••• x V fc ) which have one of the following two forms: 


1 . 


(vi, ..., avi, ■ ■ ■ ,v fc ) - o(vi, ...,Vj, • • • ,v fc ) 


for some a £ R and some 1 < i < k and some (v l5 ..., v,, • • • , vj,) £ 
Vi x ••• x V fc . 


2 . 


(Vl, ..., Vj + Wj, - ■ ■ , Vfc) - (Vi, ...,Vj,- • • , Vfc) (Vl, ...,W i, ■■■ , Vfc) 

for some 1 < i < k and some choice of (v 1; ..., v*, • • • , v fc ) £ Vi x • • • x V fc 
and w i € Vi. 


We now define (B) to be the submodule generated by B and then define 
the tensor product Vi <g> • • • (g> Vfc of the spaces Vi , • • • , Vfc to be the quotient 
module F R (V! x ••• x V k )/(B). Let 


7 T : F r (V ! x • • • x Vfc) - F r (Vi x • • • x V k )/{B) 
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be the quotient map and define vi O ' • • ® Vfc to be the image of (vi, • • • , Vfc) € 
Vi x • • • x Vfc under this quotient map. The quotient is the tensor space we 
were looking for 


Vi O • • • O Vfc := F r (Vi x • • • x Vfc)/(B). 


To get our universal object we need to define the corresponding map. The map 
we need is just the composition 


Vi x • • • x Vfc F r (Vi x • • • x Vfc) ->• Vi (8 ) • • • ® Vfc. 

We denote this map by O fc : Vi x • • • x Vfc — » Vi O • • ‘ O Vfc . Notice that 

O fc (vi, • • • , Vfc) = vi O • • • O Vfc. By construction, we have the following facts: 

1. 

vi<g)---(g> avi O ■ ■ ■ O Vfc = avi O ■ ■ ■ O v,; O ■ ■ ■ O Vfc 

for any a £ R, any i £ {1, 2, ..., fc} and any (vi, ..., v,;, • • • , Vfc) £ Vi x • • • x 

Vfc. 


2 . 


vi 0 • • • O (v» + Wj) ® — (8) Vfc 
= vi <g) • • • 0 Vj 0 • • • <g) Vfc + vi <E> ■ ■ ■ <8> Wj (g) • • • (g) Vfc 

any i £ {1, 2 , ..., k} and for all choices of vi, ..., Vfc and w*. 

Thus is multilinear. 

Definition 26.61 The elements in the image of w, that is, elements which may 
be written as vj ® ® Vfc for some Vj, are called decomposable . 

Exercise 26.8 Not all elements are decomposable but the decomposable ele- 
ments generate Vi <S> ■ • ■ ® Vfc. 

It may be that the Vi may be modules over more that one ring. For example, 
any complex vector space is a module over both R and C. Also, the module of 
smooth vector fields %m(U) is a module over C°°(U) and a module (actually 
a vector space) over R. Thus it is sometimes important to indicate the ring 
involved and so we write the tensor product of two R-modules V and W as 
V ®r W. For instance, there is a big difference between Xm(U) %m(U) 

and X M (U) Or 

Now let O fc : Vi x • • • x Vfc ^ Vi O • • • O Vfc be natural map defined above 
which is the composition of the set injection Vi x • • • x Vfc ^ Fr(Vi x x Vfc) 
and the quotient map Fr(Vi x ••• xVfc) — > Vi ® • • • <g> Vfc. We have seen that 
this map actually turns out to be a multilinear map. 
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Theorem 26.19 Given modules Vi,...,Vfc, the space Vi ® • • • ® Vfc together 
with the map <S> k has the following universal property: 

For any k-multilinear map p : Vi x • • • x Vfc — > W, there is a unique linear 
map p : Vi ® • • • ® Vfc — > W called the universal map, such that the following 
diagram commutes: 

Vi x • • • x V fc W 

® k I /y 

Vi (8» • • • ® Vfc 


Thus the pair Vi ® • • • ® Vfc, <8* is universal for k - multilinear maps on Vi x 

• • • x V fc and by ?? if T, u is any other universal pair for ^-multilinear map we 
have Vi®- • -®Vfc = T. The module Vi®- • -®Vfc is called the tensor product 
of Vi,...,Vfc. 

Proof. Suppose that /r:ViX---xVfc— >IVis multilinear. Since, Fr(Vi x 

• • • x Vfc) is free there is a unique linear map M : Fr(Vi x • • • x Vfc) — > W 
such that M((vi, ..., Vfc)) = /r(vi, ..., Vfc). Clearly, this map is zero on ( B ) and 
so there is a factorization p of M through Vi ® • • • ® Vfc. Thus we always have 

ju(vi ® • • • ® Vfc) = M((vi, ..., Vfc)) = p{vi , ..., Vfc). 

It is easy to see that fi is unique since a linear map is determined by its action 
on generators (the decomposable elements generate Vi ® • • • ® Vfc) ■ 

Lemma 26.11 There are the following natural isomorphisms: 

1) (V ® W) ® U = V® (W® U) = V® (W® U) and under these isomorphisms 
(v ® w) ® u < — > V ® (w ® u) < — > V ® w ® u. 

i?)V®W = W®V and under this isomorphism v ® w < — > w ® v. 

Proof. We prove (1) and leave (2) as an exercise. 

Elements of the form (v ® w) ® u generate (V ® W) ® U so any map that 
sends (v ® w) ® u to v ® (w ® u) for all v, w, u must be unique. Now we have 
compositions 


(V x W) x U ®^ du (V®W)xU®(V»W)®U 

and 

V x (W x U) id ^® (VxW)®U*V®(W®U). 

It is a simple matter to check that these composite maps have the same universal 
property as the map VxWxU— >V®W®U. The result now follows from 
the existence and essential uniqueness results proven so far (?? and ??). ■ 

We shall use the first isomorphism and the obvious generalizations to identify 
Vi ® • • • ® Vfc with all legal parenthetical constructions such as (((Vi ® V 2 ) ® 
• • • ® Vj) ® • • • ) ® Vfc and so forth. In short, we may construct Vi ® • • • ® Vfc by 
tensoring spaces two at a time. In particular we assume the isomorphisms 


(Vi ® • • • ® Vfc) ® (Wi ® • • • ® Wfc) =* Vi ® • • • ® Vfc ® Wi ® • • • ® Wfc 
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which map (vi ® • • • 0 v*,) ® (wj ® • • • ® w*,) to vi 0 ■ • • ® vj,. ® wj ® ® w^. 

Consider the situation where we have module homomorphisms h t : W t — > Vj 
for 1 < * < to. We may then define a map T{h\, ..., h m ) : Wi 0 • • • 0 W m — » 
Vi 0 • • • 0 V m (by using the universal property again) so that the following 
diagram commutes: 

Vi x ••• x V m 
® fc | 

Vi ® • • • ® v m 

This is functorial in the sense that 

T(hi, ..., /i m ) ° T(gi, grn') — o gi 5 ••*) ^ t?m) 

and T(id, id) = id. Also, T(/ii, ..., /i m ) has the following effect on decompos- 
able elements: 


Wi x • • • x W„ 

® fc | 

Wi ® ® w„ 


hi X . . . X hn 


T(hi,...,hm) 


T(hi , ..., h m )(v i ® • • • <g> v m ) = /ii(vi) ® • • • ® /i m (v m ). 


Now we could jump the gun a bit and use the notation hi ® • • • ® /i m for 
T(hi , ..., h m ) but is this the same thing as the element of Hoiiir(Wi, Vi) 0 • • • 0 
HoniR(W m ,V m ) which must be denoted the same way? The answer is that 
in general, these are distinct objects. On the other hand, there is little harm 
done if context determines which of the two possible meanings we are invoking. 
Furthermore, we shall see than in many cases, the two meanings actually do 
coincide. 

A basic isomorphism which is often used is the following: 

Proposition 26.7 For R— modules W,V,U we have 

Hom R (W ® V, U) = L{ W, V; U) 


More generally, 


Hom R (Wi 0 • • • 0 W fe ,U) = L(Wi, W fe ; U) 

Proof. This is more or less just a restatement of the universal property of 
W 0 V. One should check that this association is indeed an isomorphism. ■ 

Exercise 26.9 Show that if W is free with basis (/i, ..., /„) then W* is also free 
and has a dual basis / that is, f z {fj) = Sj. 

Theorem 26.20 7/Vi,...,Vfc are free R— modules and if (vf, ...,v 3 n .) is a basis 
for Vj then set of all decomposable elements of the form vj 0 • • • 0 vj form a 
basis for Vi 0 • - • 0 Vfc . 
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Proposition 26.8 There is a unique R -module map t : WJ 0 ••• 0 Wj* — > 
(Wi 0 • • • 0 Wfc)* such that if a i 0 • • • 0 aa- is a (decomposable) element of 
WJ 0 • • • <s> W£ t/ien 

t(«i (g) - - - <S ) a fc )(tui (g) • • • 0 w fc ) = «i(wi) • • • Ofc(wfc)- 
If the modules are all free then this is an isomorphism. 


Proof. If such a map exists, it must be unique since the decomposable 
elements span W* 0 • • • 0 W£. To show existence we define a multilinear map 

0 : Wi x • • • x Wt. x Wi x • • • x W fe -> R 


by the recipe 

(ai,...,a k ,wi,...,w k ) e-> ai(toi) ■■■a k (w k ). 
By the universal property there must be a linear map 

0 : Wt 0 • • • 0 W£ 0 Wi 0 • • • 0 W fe R 

such that 0 o it = 0 where w is the universal map. Now define 


i(aii 0 • • • 0 Ofc)(wi 0 • • • 0 w k ) 

:= 0(aq 0 — 0 Ofc <S> Wi 0 ■ ■ ■ 0 tUfc). 


The fact, that l is an isomorphism in case the W, are all free follows easily from 
exercise ?? and theorem ??. Once we view an element of W, as a functional 
from W** = L( W*; R) we see that the effect of this isomorphism is to change 
the interpretation of the tensor product to the “map” tensor product in (Wi <g) 
• ••(g) Wfe)* = L( Wi 0 • • • g> Wfe; R). Thus the basis elements match up under i. 


Definition 26.62 The k-th tensor power of a module W is defined to be 

W® k := W 0 • • • 0 W. 

This module is also denoted (W). We also, define the space of ( r ) -tensors 
on W ; 

0 r (W) := W 0r 0 (W*) 0S . 

Similarly, ® g r (W) := (W*)® s 0 W® r is the space of ( g r )-tensors on W. 

Again, although we distinguish 0 r g (W) from ® r (W) we shall be able 
to develop things so as to use mostly the space ® ' s (W) and so by default we 
take 0^(W) to mean ® ' S (W). 

If (v\, v n ) is a basis for a module V and (u 1 , ..., v n ) the dual basis for V* 
then a basis for ®”(V) is given by 

{v il 0 • • • 0 Vi T 0 id 1 0 • • • 0 v Js } 
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where the index set is the set X(r, s, n) defined by 

I(r,s,n) := {(«i, ji, -,j s ) : 1 <i k <n and 1 < j k < n}. 

Thus 0j(V) has dimension n r n s (where n = dim(V)). 

We re-state the universal property in this special case of tensors: 

Proposition 26.9 (Universal mapping property) Given a module or vec- 
tor space V over R, then <g)^(V) has associated with it, a map 

<g>£ : V x V x • • • x V x V* x V* x • • • x V* -> <g£(V) 

r s 

such that for any multilinear map A £ T r S (V;R); 


A:VxVx---xVxV*xV*---xV*— >R 

' V ' ' v ' 

r— times s— times 

there is a unique linear map A : (g) r (V) — > R such that A o = A. Up to 
isomorphism, the space (g) ' (V) is the unique space with this universal mapping 

property. 

Corollary 26.3 There is an isomorphism (0 ' (V))* = T r s (V) given by 
A h Ao (g^. (Warning: Notice that the T r (V) occurring here is not the 
default space T r S (V) that we often denote by TJ(V).) 

Corollary 26.4 ((g) r g (V*))* =T/(N*) 

Now along the lines of the map of proposition ?? we have a homomorphism 

Z :<g> r s (V)-T r s (V) (26.7) 

given by 

ta((vi g • • • ® v r (g> r/ 1 g) • • • .. <g> ? 7 S ) ) (0 1 , ..., 9 r , Wi, .., w g ) 

= 0 i(vi) 0 2 (v 2 ) • • • 0 i(v;)? 7 1 (wi)? 7 2 (w 2 ) • ■■r) k (w k ) 


0 1 ,9 2 ,...,9 r £ V* and wi , w 2 , . . , w^ £ V. If V is a finite dimensional free module 
then we have V = V**. This is the reflexive property. We say that V is totally 
reflexive if the homomorphism ?? just given is in fact an isomorphism. This 
happens for free modules: 

Proposition 26.10 For a finite dimensional free module V we have a natural 
isomorphism (g) ' (V) = T r (V). The isomorphism is given by the map i r s 
(see ?? ) 
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Proof. Just to get the existence of a natural isomorphism we may observe 
that 


® r s (V) = ®\(V**) = (0 r s (V*))* 
= T/(V*) = L(V* r ,V** s ;R) 

= L(y* r , V s ; R) : =T r (V). 


We would like to take a more direct approach. Since V is free we may take 
a basis {/i, ..., /„} and a dual basis j/ 1 ,...,/"} for V*. It is easy to see that 
L r s sends the basis elements of ® r (V) to basis elements of T r (V; R) as for 
example 

4 : f ® fj ~ f ® fj 

where only the interpretation of the ® changes. ■ 

In the finite dimensional case, we will identify 0 r (V) with the space 
T r s (V;R) =L(V r *, V s ; R) of r, s- multilinear maps. We may freely think of a 
decomposable tensor vi ® ... ® v r ® r/ 1 ® ... ® if as a multilinear map by the 
formula 


(vi ® • • • ® v r ® r] 1 ® • • • .. <g> g s ) • (0 1 , ..., 0 r , wi, .., w s ) 

= 0i(vi)0 2 (v 2 ) • • • 0 /(v;)77 1 (wi)?7 2 (w 2 ) • • • T] k (w k ) 

U) of smooth vector fields over an open set U in some manifold M. We shall 
see that for finite dimensional manifolds TJ(f£(f7)) is naturally isomorphic to 
the smooth sections of a so called tensor bundle. R— Algebras 

Definition 26.63 Let R be a commutative ring. An R~algebra% is an R —module 
that is also a ring with identity la where the ring addition and the module ad- 
dition coincide; and where 

1) r(aia 2 ) = (rai)a 2 = ai(ra 2 ) for all ai,a 2 £ 21 and all re R. 

2) (rir 2 )(aia 2 ) = (riai)(r 2 a 2 ). 

If we also have (aia 2 )a 3 = ai(a 2 a 3 ) for all ai,a 2 ,a ,3 £ 21 we call 21 an 
associative R— algebra. 

Definition 26.64 Let 21 and 03 be R —algebras. A module homomorphism h : 

2t — » 03 that is also a ring homomorphism is called an R— algebra homo- 
morphism. Epimorphism, monomorphism and isomorphism are defined in the 
obvious way. 

If a submodule 3 of an algebra 21 is also a two sided ideal with respect to 
the ring structure on 21 then 21/3 is also an algebra. 

Example 26.20 The set of all smooth functions C°°(U ) is an R — algebra (R 
is the real numbers) with unity being the function constantly equal to 1. 

Example 26.21 The set of all complex n x n matrices is an algebra over C 
with the product being matrix multiplication. 
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Example 26.22 The set of all complex n x n matrices with real polynomial 
entries is an algebra over the ring of polynomials R[x]. 

Definition 26.65 The set of all endomorphisms of an R— module W is an 
R— algebra denoted End^pW) and called the endomorphism algebra of W. 
Here, the sum and scalar multiplication is defined as usual and the product is 
composition. Note that for re R 

r(f o g) = (rf) o g = f o (■ rg ) 

where f,g e EndR{W). 

Definition 26.66 A Z -graded R-algebra is an R -algebra with a direct sum de- 
composition 21 = Yliai 21 i such that 21,21/ C 21;+ j . 

Definition 26.67 Let 21 = JA gZ 2t; an d 23 = ^e graded algebras. 

An R-algebra homomorphism h : 21 — > 23 is a called a Z -graded homomorphism 
if /i(2lj) C 23 i for each i £ Z. 

We now construct the tensor algebra on a fixed R— module W. This algebra is 
important because is universal in a certain sense and contains the symmetric and 
alternating algebras as homomorphic images. Consider the following situation: 
21 is an R— algebra, W an R— module and <p : W — > 21 a module homomorphism. 
If h : 21 — > 23 is an algebra homomorphism then of course h o cp : W — ► 23 is an 
R— module homomorphism. 

Definition 26.68 Let W be an R— module. An R— algebra if together with a 
map <f> : W — > it is called universal with respect to W if for any R— module 
homomorphism ip : W — > 23 there is a unique algebra homomorphism If h : if — > 
23 such that, ho cp — ip. 

Again if such a universal object exists it is unique up to isomorphism. We 
now exhibit the construction of this type of universal algebra. First we define 
T°(W) := R and T X (W) := W. Then we define T k { W) := W fc ® = W <g> • • • ® W. 
The next step is to form the direct sum T( W) := cPqT 1 (W). In order to 
make this a Z-graded algebra we define T*(W) := 0 for i < 0 and then define a 
product on T(W) := ie z.T l (W) as follows: We know that for i,j >0 there is 
an isomorphism W 1 ® 0 W- 7 ® — > W^ +J 2® and so a bilinear map W l ® x W J ® — » 
such that 

W1 ® • • • 0 Wi X W 'i ® • • • 0 Wj e- > Wi 0 ■ ’ ■ 0 Wi 0 W/ ® • • ’ 0 Wj ■ 

Similarly, we define T°( W) x W 1 ® = R x W 1 ® — » W ! ® by just using scalar 
multiplication. Also, W ! ® x W- 7 ® — > 0 if either i or j is negative. Now we may 
use the symbol 0 to denote these multiplications without contradiction and put 
then together to form an product on T(W) := *6 zT l (W). It is now clear 

that T 4 ( W) x T J ’(W) i— > T ? (W) ® T- 7 (W) C T I+J (W) where we make the needed 
trivial definitions for the negative powers T*( W) = 0, * < 0. Thus T(W) is a 
graded algebra. 
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26.12.1 Smooth Banach Vector Bundles 

The tangent bundle and cotangent bundle are examples of a general object 
called a (smooth) vector bundle which we have previously defined in the finite 
dimensional case. As a sort of review and also to introduce the ideas in the case 
of infinite dimensional manifolds we will define again the notion of a smooth 
vector bundle. For simplicity we will consider only C°° manifold and maps in 
this section. Let E be a Banach space. The most important case is when E is a 
finite dimensional vector space and in that case we might as well take E= K”. 
It will be convenient to introduce the concept of a general fiber bundle and then 
specialize to vector bundles. The following definition is not the most efficient 
logically since there is some redundancy built in but is presented in this form 
for pedagogical reasons. 

Definition 26.69 Let F be a smooth manifold modelled on F. A smooth fiber 
bundle £ = (E, tte, M, F) with typical fiber F consists of 

1 ) smooth manifolds E and M referred to as the total space and the base 
space respectively and modelled on Banach spaces M x F and M respectively; 

2) a smooth surjection tte ■ E — > M such that each fiber E x = ir~ l {x} is 
diffeomorphic to F; 

3) a cover of the base space M by domains of maps <f> a : Ejj a := tt g (U a ) — > 
U a x F, called bundle charts, which are such that the following diagram com- 
mutes: 

E Ua 1 ^ U a x F 

tte \ y 

u a 

Thus each <p a is of the form {itu a > 3> a ) where i Tjj a := nE\U a and d 1 ,-, : Ejj a — > 
F is a smooth submersion. 

Definition 26.70 The family of bundle charts whose domains cover the base 
space of a fiber bundle as in the above definition is called a bundle atlas. 

For all x £ U a , each restriction Q a ,x '■= < I> Q ,| £; is a diffeomorphism onto 
F. Whenever we have two bundle charts <j> a = ( 7 T£;, 4 > q ) and <j)p = (tte,^p) 
such that U a nUp 0 then for every x £ U a D Up we have the diffeomorphism 
< F q / 3 ] x = o : F — * F. Thus we have map g a p : U a fl Up — * Diff(F) 

given by g a p{x) := 4 > a/ 3 iX . Notice that gp a o g~ ^ = id. The maps so formed 
satisfy the following cocycle conditions: 

$ 7/3 ° 9a 7 = 9 aft whenever U a fiUp fl t / 7 0. 

Let £ be as above and let U be open in M. Suppose we have a smooth map 
<f> : Ejj — > U x F such that the diagram 

E v 4 U x F 
ku \ / 

U 
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where Ejj := t^e (U) as before. We call <f> a trivialization and even if 4> was 
not one of the bundle charts of a given bundle atlas, it must have the form 
(tte, 4?) and we may enlarge the atlas by including this map. We do not wish to 
consider the new atlas as determining a new bundle so instead we say that the 
new atlas is equivalent. There is a unique maximal atlas for the bundle which 
just contains every possible trivialization. 

Now given any open set U in the base space M of a fiber bundle £ = 

( E , 7 te, M, F) we may form the restriction (\ U which is the fiber bundle (n^. (U), ee\ U, U, F). 
To simplify notation we write Ejj := 77 ^ (17) and tte\ U := njj. This is a special 
case of the notion of a pull back bundle. 

One way in which vector bundles differ from general fiber bundles is with 
regard to the existence on global sections. A vector bundle always has at least 
one global section. Namely, the zero section 0 e '■ M —> E which is given by 
x e- > 0 X G E x . Our main interest at this point is the notion of a vector bundle. 

Before we proceed with our study of vector bundles we include one example of 
fiber bundle that is not a vector bundle. 

Example 26.23 Let M and F be smooth manifolds and consider the projection 
map pr i : M x F — > M. This is a smooth fiber bundle with typical fiber F and 
is called a product bundle or a trivial bundle. 

Example 26.24 Consider the tangent bundle tm '■ TM — > M of a smooth 
manifold modelled on R". This is certainly a fiber bundle (in fact, a vector 
bundle) with typical fiber R™ but we also have the bundle of nonzero vectors 
7 r : TAI X — > M defined by letting TM X := {i> G TM : v 0} and i r := tm|tm x • 

This bundle may have no global sections. 

Remark 26.14 A “ structure ” on a fiber bundle is determined by requiring that 
the atlas be paired down so that the transition maps all have values in some 
subset G (usually a subgroup) of Diff(E). Thus we speak of a G— atlas for 
£ = (E,tte, M, F). In this case, a trivialization (tte,&) '■ Ejj — > U x F is 
compatible with a given G— atlas A(G) if €= G and 4>, r o <I ) ~ l x G G for all 

(tte,^o) G A(G). The set of all trivializations (bundle charts) compatible which 
a given G— atlas is a maximal G— atlas and is called a G -structure. Clearly, 
any G— atlas determines a G-structure. A fiber bundle £ = (E,tte, M, F) with 
a G-atlas is called a G-bundle and any two G-atlases contained in the same 
maximal G-atlas are considered equivalent and determine the same G-bundle. 

We will study this idea in detail after we have introduced the notion of a Lie 
group. 

We now introduce our current object of interest. 

Definition 26.71 A (real) vector bundle is a fiber bundle £ = ( E,tte,M , E) 
with typical fiber a (real) Banach space E such that, for each pair of bundle chart 
domains U a and Up with nonempty intersection, the map 

gap • X 1 - *&afi,x • 4 * a,x ^ 
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is a C°° morphism g a p : U a fl Up — > GL (£). If E is /znite dimensional, say 
E = K", i/ien we say that £ = (E, 7Te,M, M") has rank n. 

So if G 7r^ 1 (a;) C E Ua then < p a (v x ) = (x, <I>a,x(vx)) for 4> a>;E : E x — >E 
a diffeomorphism. Thus we can transfer the vector space structure of V to 
each fiber E x in a well defined way since $ a ,x o <[>g x G GL(E) for any x in 
the intersection of two VB-chart domains U a and Up. Notice that we have 
4> a ° < i>p(x,v ) = (. x,g a p(x ) • v) where g a p : U a fl Up — » GL(E) is differentiable 
and is given by g a p{x) = o Notice that g a p{x ) G GL(E). 

A complex vector bundle is defined in an analogous way. For a complex 
vector bundle the typical fiber is a complex vector space (Banach space) and 
the transition maps have values in GL(E;C). 

The set of all sections of real (resp. complex) vector bundle is a vector 
space over R. (resp. C) and a module over the ring of smooth real valued (resp. 
complex valued) functions. 

Remark 26.15 If E is of finite dimension then the smoothness of the maps 
g a p : U a fl Up — > GL(£) is automatic. 

Definition 26.72 The maps g a p{x) '■= < f > a>a; o<f)^ x are called transition maps. 
The transition maps always satisfy the following co cycle condition: 
g -,/3 (x) o gp a {x) = g ia {x) 

In fact, these maps encode the entire vector bundle up to isomorphism: 

Remark 26.16 The following definition assumes the reader knows the defini- 
tion of a Lie group and has a basic familiarity with Lie groups and Lie group 
homomorphisms. We shall study Lie groups in Chapter ??. The reader may 
skip this definition. 

Definition 26.73 Let G be a Lie subgroup o/GL(£). We say that tte ■ E — > M 
has a structure group G if there is a cover by trivializations (vector bundle 
charts) <f a : Ejj a — > U a xE such that for every non-empty intersection U a fl Up, 
the transition maps g a p take values in G. 

Remark 26.17 Sometimes it is convenient to define the notion of vector bundle 
chart in a slightly different way. Notice that U a is an open set in M and 
so (f a is not quite a chart for the total space manifold E. But by choosing a 
possibly smaller open set inside U a we may assume that U a is the domain of an 
admissible chart U a ,il> a for M. Then we can compose to get a map tj> a : Ejj a —> 
’f , a(U a )x E. The maps now can serve as admissible charts for the differentiable 
manifold E. This leads to an alternative definition of VB-chart which fits better 
with what we did for the tangent bundle and cotangent bundle: 
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Definition 26.74 (Type II vector bundle charts) A (type II) vector bun- 
dle chart on an open set V C E is a fiber preserving diffeomorphism <p : V — » 
OxE which covers a diffeomorphism <f>:% te(V) — > O in the sense that the follow- 
ing diagram commutes 

V 4 OxE 
ke 1 I Wi 

7 te(V) — > O 

<t> 


and which is a linear isomorphism on each fiber. 

Example 26.25 The maps Tip a : TU a — > ip a (U a ) x E are (type II) VB-charts 
and so not only give TM a differentiable structure but also provide TM with a 
vector bundle structure. Similar remarks apply for T*M. 

Example 26.26 Let E be a vector space and let E = AIxE. The using the 
projection pr± : MxE — > M we obtain a vector bundle. A vector bundle of this 
simple form is called a trivial vector bundle. 

Define the sum of two section Si and S 2 by (si + S 2 )(p) '■= sffp) + S 2 (p). 
For any f g C°°(U) and s € T(U,E) define a section fs by ( fs)(p ) = f(p)s(p). 
Under these obvious definitions T(U,E) becomes a C ,oc ([/(-module. 

The the appropriate morphism in our current context is the vector bundle 
morphism: 

Definition 26.75 Definition 26.76 Let (E,tte, M) and N ) be vector 

bundles. A vector bundle morphism (E,tte,M) — > (F,ttf,N) is a pair of 
maps f : E — »■ F and fo'-M^N such that 

1. Definition 26. 77 1) The following diagram commutes: 


f 




E 

F 


KE 

1 

1 Ep 



M 

N 




fo 

and f\ E is a 

continuous linear map from E p into Ff 0 r p ) for each p € M. 

2) For each Xq £ M there 

exist 

VB-charts (7 te,4>) : Ejj — >■ UxE and 

(n E ,& a ) ■ Fiji 

— > U'xE with Xq 

g U and fo(U) C V such that 


X 1— > 


^ 0 ^ 0 *\e x 


is a smooth map from U into GL(E.E'). 
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Notation 26.6 Each of the following is a valid way to refer to a vector 
bundle morphism: 

1) (/, / 0 ) : (E,tt e ,M,E) -+ (F,n F ,N, F) 

2) f : (E,tte, M) — » (F,tt f ,N) (the map fo is induced and hence under- 
stood) 

3) f : f i — > £2 (this one is concise and fairly exact once it is set down that 
£1 = (£ 1 , 71 - 1 , M) and £> = ( E 2 ,n 2 ,M ) 

4) f n F 

5) E -4 F 

Remark 26.18 There are many variations of these notations in use and the 
reader would do well to get used to this kind of variety. Actually, there will be 
much more serious notational difficulties in store for the novice. It has been said 
that notation is one of the most difficult aspects of differential geometry. On the 
other hand, once the underlying geometric picture has been properly understood 
, one may “see through” the notation. Drawing diagrams while interpreting 
equations is often a good idea. 

Definition 26.78 Definition 26.79 If f is an (linear) isomorphism on each 
fiber E p then we say that f is a vector bundle isomorphism and the two 
bundles are considered equivalent. 

Notation 26.7 If f is a VB morphism from a vector bundle tte : E — * M to 
a vector bundle n F : F — ► M we will sometimes write this as f : tte — > 7 x F or 
1 

TTe — > 7 Tp- 

Definition 26.80 A vector bundle is called trivial if there is a there is a vector 
bundle isomorphism onto a trivial bundle: 


E -> MxE 

7 r E \ >/ pri 

M 


Now we make the observation that a section of a trivial bundle is in a sense, 
nothing more than a vector- valued function since all sections s £ E) 

are of the form p — > (p, f(p)) for a unique function / € C°°(M, E). It is common 
to identify the function with the section. 

Now there is an important observation to be made here; a trivial bundle 
always has a section which never takes on the value zero. There reason is that 
we may always take a trivialization f> : E — > M x E and then transfer the obvious 
nowhere-zero section p 1 — > (p, 1) over to E. In other words, we let si : M — > E be 
defined by si(p) = 1). We now use this to exhibit a very simple example 

of a non-trivial vector bundle: 
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Example 26.27 (Mobius bundle) Let E be the quotient space of [0, 1] x I 
under the identification of (0,f) with (1, —t). The projection [0, 1] x 1 — >[0, 1] 
becomes a map E — > S 1 after composition with the quotient map: 

[0,1] xl -► [0,1] 

I I 

E -* S 1 

Here the circle arises as [0, 1]/ ~ where we have the induced equivalence relation 
given by taking 0 ~ 1 in [0, 1]. The familiar Mobius band has an interior which 
is diffeomorphic to the Mobius bundle. 

Now we ask if it is possible to have a nowhere vanishing section of E. It is 
easy to see that sections of E correspond to continuous functions f : [0, 1] — » R. 
such that /( 0) = — /( 1). But then continuity forces such a function to take 
on the value zero which means that the corresponding section of E must vanish 
somewhere on S 1 = [0,1]/ Of course, examining a model of a Mobius band 
is even more convincing; any nonzero section of E could be, if such existed, 
normalized to give a map from S 1 to the boundary of a Mobius band which only 
went around once, so to speak, and inspection of a model would convince the 
reader that this is impossible. 

Let = (Ei, 7Ti, M, Ei) and £ 2 = (E 2 , 7r 2 , M, E 2 ) be vector bundles locally 
isomorphic to Mx Ei and Mx E 2 respectively. We say that the sequence of 
vector bundle morphisms 

0 — + Ci £2 

is exact if the following conditions hold: 

1. There is an open covering of M by open sets U a together with trivializa- 

tions 4>i >a : — > U a x Ei and </> 2jQ , : tt f (U a ) — > U a x E 2 such that 

E 2 = Ei x F for some Banach space F; 

2. the diagram below commutes for all a : 

nf\U a ) - TTf\U a ) 

01, a I I 02, a 

U a X El — > U a x Ei x F 

Definition 26.81 A subbundle of a vector bundle f = ( E,n,M ) is a vector 
bundle of the form f = ( L , 7t| l , M) where n\ L is the restriction to L C E, and 
where L C E is a submanifold such that 

0 

is exact. Here, £| L — > f is the bundle map given by inclusion: L E . 

Equivalently, : n\ L : L — > M is a subbundle if L C E is a submanifold and 
there is a splitting E = Ei x F such that for each p £ M there is a bundle chart 
4> : 7r _1 f7 — > U x E with p £ U and (^((n^ 1 !!) n L) = U x Ei x {0}. 
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Definition 26.82 The chart <fi from the last definition is said to be adapted to 
the subbundle. 

Notice that if L C E is as in the previous definition then 7 t| l : L — > M is a 
vector bundle with VB-atlas given by the various V B-charts U, <j> restricted to 
(7T” 1 B)n5' and composed with projection U xEiXjO} — xEj so x\ L is a bundle 
locally isomorphic to M x Ei. The hber of n\ L at p £ L is L. p = E p (~l L. Once 
again we remind the reader of the somewhat unfortunate fact that although 
the bundle includes and is indeed determined by the map n\ L : L — > M we 
often refer to L itself as the subbundle. In order to help the reader see what 
is going on here lets us look at how the definition of subbundle looks if we are 
in the finite dimensional case. We take M = E = R m and Ei x F is the 
decomposition K” = x R m-fc . Thus the bundle n : E — > M has rank m (i.e. 
the typical fiber is K m ) while the subbundle ir\ L : L —> AI has rank k. The 
condition described in the definition of subbundle translates into there being a 
VB-chart <f> : ir~ 1 U — » U x R fc x W n ~ k with (/>((7r _1 f/) n L) = U x x {0}. 
What if our original bundle was the trivial bundle pr\ : U x R m — > [7? Then 
the our adapted chart must be a map U x R m — > U x R k x R m-fc which must 
have the form (x,v) i— > (x,f(x)v, 0) where for each x the f(x) is a linear map 
R m -> R k . 

26.12.2 Formulary 

We now define the pseudogroup (s) relevant to the study of foliations. Let M = 
E x F be a (split) Banach space. Define T m,f to be the set of all diffeomorphisms 
from open subsets of E x F to open subsets of E x F of the form 

$(*,y) = (/(x,y),s(y))- 

In case M is n dimensional and M = K™ is decomposed as R k x K 9 we write 
Tm.f = r n>g . We can then the following definition: 

Definition 26.83 A Tm.f structure on a manifold M modelled on M = E x F 
is a maximal atlas of charts satisfying the condition that the overlap maps are 
all members ofT m,f- 

1) i [x,v] = C-x ° ty + ty ° £-x 

2) Cfxx> = fCxOJ + df A lxoj for all u> € f 2(M) 

3) d(a A 0) = (da) A (3 + (— 1 ) k a A (dj3) 

4) f t Fl? *Y = FI? *(L X Y) 

5) [Ifl = EV, ) S 

Example 26.28 (Frame bundle) Let M be a smooth manifold of dimension 
n. Let F X (M) denote the set of all bases (frames) for the vector space T X M . 
Now let F(M) := [j xeM F X (M). Define the natural projection tt : F(M) — ■> M 
by 7r(f) = x for all frames f = (ff) for the space T X M . It can be shown that 
F(M) has a natural smooth structure. It is also a GL(n,M.) -bundle whose typical 
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fiber is also GL(n, R). The bundle charts are built using the charts for M in the 
following way: Let U a ,xl> a be a chart, for M. Any frame f = (ff) at some point 
x £ U a may be written as 


= ajj 


We then map f to ( x , (cj)) £ U a xGL(n,,R). This recipe gives a map n 1 (U a ) 
U a x GL(?r, R) which is a bundle chart. 


Definition 26.84 A bundle morphism (/, /o) : £i — s > £2 from one fiber bundle 
£1 = (Ei,tt E i , Mi, Fi) to another & = (E 2 , 7Te 2 , M 2 , F 2 ) is a pair of maps (/, /o) 
such that the following diagram commutates 


£1 A E 2 

7T Ei I n E2 I 

Mi * M 2 


In case M\ = AI 2 and /o = id m we call f a strong bundle morphism. In 
the latter case if f : E\ — > E 2 is also a diffeomorphism then we call it a bundle 
isomorphism. 


Definition 26.85 Let £1 and £2 be fiber bundles with the same base space M. 

If there exists a bundle isomorphism (/, Mm) : £1 — > £2 we say that £1 and £2 

fib 

are isomorphic as fiber bundles over M and write £1 = f 2 - 

Now given any open set U in the base space M of a fiber bundle £ = 

( E , 7 te, M , F) we may form the restriction £| U which is the fiber bundle (7Tg 1 (C/), tte\ U, U, F). 
To simplify notation we write Ejj := and tte\U := 7 t e . 

Example 26.29 Let M and F be smooth manifolds and consider the projection 
map pr 1 : M x F — > M . This is a smooth fiber bundle with typical fiber F and 
is called a product bundle or a trivial bundle. 

A fiber bundle which is isomorphic to a product bundle is also called a 
trivial bundle. The definition of a fiber bundle f with typical fiber F includes 
the existence of a cover of the base space by a family of open sets {U a } a& A such 

fib 

that £| U a = U x F for all a £ A. Thus, fiber bundles as we have defined them, 
are all locally trivial. 

Misc 

1— form 9 = fA e j9 l which takes any vector to itself: 

6 ( v p) = ^2 e j(pW( v p) 

= ^2 vte j(p) = v p 
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Let us write d w 9 = \ e k 0 T k j9 l A 9^ — \ J2 e k 0 r k . If V is the Levi Civita 
connection on M then consider the projection P A : E ®TM 0 T*M given by 
P A T(£, v) = T(£, v) - T(v, 0- We have 

Vej = u k e k = eco 

V6 j = —co J k 9 k 

V^(ej 0 9 j ) 

P A (V ? ^)(u) = -ul(O0 k {v) + tji(v)9 k (0 = -ul A 9 k 
Let T(£,v) = V{(ej 0 0 J ')(u) 

= (V ? ej) 0 6 j (v) + e* 0 (V 5 0 J )(n) = u k (^)e k 0 0 J '(u) + e* 0 (-a4(£)0 fe (u)) 
= Ui(Q e k ® + e* 0 (-u!f(€)0 j (v)) = e fc 0 (wf (0 - w$(£))9 j (v) 

Then 


(P A T)(e,n)=T(? ) n)-T(n,e) 

= (Vej) A + ej 0 
= d v (ej 0 


d v 6> = d v J2 e i 9j 

= E^ ,) Ad J • , 0d0* (26.8) 

= E(E e k 0 u>j) A + E e*; (8) dO k 

k 

= E e fc®(E^ A ^+ d0fc ) 

fc j 

So that J2j u k A Qi + d9 k = \ r k . Now let a = /^’ej be a vector field 

d v d v a = d v (d v Cjf j ) = d V (E( Ve f )/ J + E e i 0 d f j ) 

(E(Ve,)^ + E' : ' /V V0)/'' + E Vejd/-' + ej 0 ddfi^J 

Yr^v ej )^Yf j 

So we seem to have a map / J ej i— > il k f :l e k . 

e r flj = d v Vej = d v (e k u k ) 

= Vefc A Wj + e k du> k 
= e r u> k A Wj + e k duj k 
= c r ui k A oj k T € r dt0j 
= e r (dujj + u> k A Wj ) 
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ci v Ve = d v (ew) = Ve A oj + echo 

From this we get 0 = d(A~ 1 A)A~ X = {dA~ 1 )AA~ 1 + A -1 cL4A _1 
dA- 1 = A~ 1 dAA~ 1 

= du] + co r k A ujj 
Sl = du + wAu 
Q — duj oj A uj 

Q' = d (A-'wA + A~ l dA) + (A-'utA + A~ 1 dA) A (A~ 1 toA + A~ x dA) 

— d ( A 4.) 4- d (A ^ dA^ 4- A ^co A ujA 4- A A ciA 
+ A~ 1 dAA~ 1 Awi+ A _1 dA A A~ 1 dA 

— d (A ^tuA) 4~ dA ' A dA 4- A A tuA 4- A A dA 
+ A~ 1 dAA~ 1 ujA + A~ 1 dA A A _1 dA 

= dA _1 wA 4- A _1 dwA — A~ l u;dA + dA _1 A dA 4- A _1 w A w4 4- A _1 w A dA 
+ A~ 1 dAA~ 1 A w4 4- A _1 dA A A _1 dA 
= A ^dcuA -4 A cu A cuA 
fi' = A _1 flA 

lu' = A~ 1 ojA + A~ x dA 

These are interesting equations let us approach things from a more familiar 
setting so as to interpret what we have. 


26.13 Curvature 

An important fact about covariant derivatives is that they don’t need to com- 
mute. If a : M —> E is a section and X £ £(M) then Vx<t is a section 
also and so we may take it’s covariant derivative VyVxc r with respect to some 
Y £ £(M). In general, VyVx a ^ VxVycr and this fact has an underly- 
ing geometric interpretation which we will explore later. A measure of this 
lack of commutativity is the curvature operator which is defined for a pair 
X, Y £ X(M) to be the map F(X, Y) : T(E) — » r(.E) defined by 

F(X, Y)a := Vx Vycr — Vy Vjxcr — V 
or 

[Vx,Vy]a - Vpqy-jer 


26.14 Group action 

p : G x M — > M 

p(g , x) as gx then require that 
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Exercise 26.10 1 ) g-\ (g-ix) = (gig 2 )% for all gi,g 2 €E G and for all x € M 

2) ex = x for all x £ M, 

3) the map ( x , g) 


26.15 Notation and font usage guide 


Category 

Space or object 

Typical elements 

Typical morphisms 

Vector Spaces 

V , W , R n 

v,w, x,y 

A,B, K,X,L 

Banach Spaces 

E.F.M.N.VAV, R" 

v,w,x,y etc. 

A, B, K, A, L 

Open sets in vector spaces 

u, V, O, u a 

p,q,x,y,v,w 

fi9,P,' t P 

Differentiable manifolds 

M,N,P,Q 

P,Q,x,y 

f,g,v,if 

Open sets in manifolds 

u,v,o,u a 

p, q,x,y 

f,_g i p, if, 

Bundles 

E —> M 

v,w,^p,q,x 

(/,/)> ( g,id),h 

Sections of bundles 

T(M, E) 

S, Si , (J, ... 

f* 

Sections over open sets 

T(U,E)=S*(U) 

S, Si , (J, ... 

f* 

Lie Groups 

G, H, K 

g,h,x,y 

h, f,g 

Lie Algebras 

0,M, a, b 

v,x,y,z,£ 

h, g , df, dh 

Fields 

F,R,C,K 

t,s,x,y,z,r 

f,g,h 

Vector Fields 

X m (U),X(M) 

X,Y,Z 

/*,/.?? 


So, as we said, after imposing rectilinear coordinates on a Euclidean space 
E n (such as the plane E 2 ) we identify Euclidean space with R", the vector space 
of n— tuples of numbers. We will envision there to be a copy R^ of R" at each of 
its points p £ R". The elements of R” are to be thought of as the vectors based 
at p, that is, the “tangent vectors”. These tangent spaces are related to each 
other by the obvious notion of vectors being parallel (this is exactly what is not 
generally possible for tangents spaces of a manifold). For the standard basis 
vectors e 7 (relative to the coordinates xf) taken as being based at p we often 

write and this has the convenient second interpretation as a differential 

Xi p 

operator acting on C°° functions defined near p £ R”. Namely, 


d 

dxi 


f 

p 



An n-tuple of G°°functions X 1 , ...,X n defines a C°° vector field X = 
whose value at p is J2X l (p) ■ Thus a vector field assigns to each p in its 

domain, an open set U, a vector 'f2X l ( y p ) at p. We may also think of 

X% p 

vector field as a differential operator via 


/ i— > Xf G C°°(U) 

(Xf)(p) 

Example 26.30 X = —x-f^ is a vector field defined on U = R 2 — {0} and 

(Xf)(x,y) = y^(x, y) - f -(x,y). 
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Notice that we may certainly add vector fields defined over the same open 
set as well as multiply by functions defined there: 


(. fX + gY)(p) = f(p)X(p) + g(p)X(p) 

The familiar expression df = + • • • + J^-dx n has the intuitive inter- 

pretation expressing how small changes in the variables of a function give rise 
to small changes in the value of the function. Two questions should come to 
mind. First, “what does ‘small’ mean and how small is small enough?” Second, 
“which direction are we moving in the coordinate” space? The answer to these 
questions lead to the more sophisticated interpretation of df as being a linear 
functional on each tangent space. Thus we must choose a direction v p at p £ R™ 
and then df(v p ) is a number depending linearly on our choice of vector v p . The 
definition is determined by dxt(ej) = Sij. In fact, this shall be the basis of our 
definition of df at p. We want 


Df 


x 5 / 

) : = d£ t (p) - 


Now any vector at p may be written v p = YAi = i yl 
use v p as a differential operator (at p): 


-JX which invites us to 

x ' p 


v P f : = 

i= 1 



ip) e R 


This consistent with our previous statement about a vector field being a differ- 
ential operator simply because X ( p ) = X p is a vector at p for every p £ U. This 
is just the directional derivative. In fact we also see that 


D f\ P i v P ) = £ 

3 3 



d_ 

dxi 


= £ 



ip) = Vpf 


so that our choices lead to the following definition: 


Definition 26.86 Let f be a C°° function on an open subset U o/M". By the 
symbol df we mean a family of maps Df | with p varying over the domain U 
of f and where each such map is a linear functional of tangent vectors based at 
p given by Df \ (v p ) = v p f = £?=! v*§f ip). 


Definition 26.87 More generally, a smooth l-form a on U is a family of linear 


functionals a p : T p . 


with p £ U which is smooth is the sense that 


a pi A I" ) a smooth function of p for all i. 
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From this last definition it follows that if X = X 1 is a smooth vector field 
then ot(X)(p) a p (X p ) defines a smooth function of p. Thus an alternative 
way to view a 1— form is as a map a : X i— > a(X) which is defined on vector 
fields and linear over the algebra of smooth functions C°°(U ) : 

a(fX + gY) = fa(X) + ga(Y). 


Fixing a problem. It is at this point that we want to destroy the privilege 
of the rectangular coordinates and express our objects in an arbitrary coordinate 
system smoothly related to the existing coordinates. This means that for any 
two such coordinate systems, say it 1 , ...,u n and y 1 , ....,y n we want to have the 
ability to express fields and forms in either system and have for instance 




(y) 


% 


= X = XI 


_d_ 


for appropriate functions , X) ,. This equation only makes sense on the 
overlap of the domains of the coordinate systems. To be consistent with the 
chain rule we must have 

d did d 

dy l dy l dui 

which then forces the familiar transformation law: 


E 


dui 

dy i 


\rl -y% 

X {y) ~ A 0) 


We think of X(^ and X 1 , . as referring to, or representing, the same geometric 
reality from the point of view of two different coordinate systems. No big deal 
right? Well, how about the fact that there is this underlying abstract space 
that we are coordinatizing? That too is no big deal. We were always doing it 
in calculus anyway. What about the fact that the coordinate systems aren’t 
defined as a 1-1 correspondence with the points of the space unless we leave 
out some points in the space? For example, polar coordinates must exclude 
the positive x-axis and the origin in order to avoid ambiguity in 9 and have a 
nice open domain. Well if this is all fine then we may as well imagine other 
abstract spaces that support coordinates in this way. This is manifold theory. 
We don’t have to look far for an example of a manifold other than Euclidean 
space. Any surface such as the sphere will do. We can talk about 1-forms like 
say a = 6d<j)+<j) sm(6)d0, or a vector field tangent to the sphere 9 sin (</>) -§g+9 2 -§^ 
and so on (just pulling things out of a hat). We just have to be clear about 
how these arise and most of all how to change to a new coordinate expression 
for the same object. This is the approach of tensor analysis. An object called a 
2-tensor T is represented in two different coordinate systems as for instance 


V T ij — 
f ^ (y) Qy'i 


d 

dyi 


sr T ij 

(«) Qyl 


d 

dui 
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where all we really need to know for many purposes the transformation law 

T ij _ sr T rs d v l d y l 

iv) (U) g u r Q u s ' 

r,s 

Then either expression is referring to the same abstract tensor T. This is just 
a preview but it highlight the approach wherein a transformation laws play a 
defining role. 

In order to understand modern physics and some of the best mathematics it is 
necessary to introduce the notion of a space (or spacetime) which only locally has 
the (topological) features of a vector space like R n . Examples of two dimensional 
manifolds include the sphere or any of the other closed smooth surfaces in R 3 
such a torus. These are each locally like R 2 and when sitting in space in a nice 
smooth way like we usually picture them, they support coordinates systems 
which allow us to do calculus on them. The reader will no doubt be comfortable 
with the idea that it makes sense to talk about directional rates of change in 
say a temperature distribution on a sphere representing the earth. 

For a higher dimensional example we have the 3— sphere S 3 which is the 
hypersurface in R 4 given by the equation x 2 + y 2 + z 2 + w 2 = 1. 

For various reasons, we would like coordinate functions to be defined on 
open sets. It is not possible to define nice coordinates on closed surfaces like the 
sphere which are defined on the whole surface. By nice we mean that together 
the coordinate functions, say, 9, <f> should define a 1-1 correspondence with a 
subset of R 2 which is continuous and has a continuous inverse. In general the 
best we can do is introduce several coordinate systems each defined on separate 
open subsets which together cover the surface. This will be the general idea for 
all manifolds. 

Now suppose that we have some surface S and two coordinate systems 

(9, <t>):U i -► R 2 
(u, v) : U2 —> R 2 

Imagine a real valued function / defined on S (think of f as a temperature or 
something). Now if we write this function in coordinates (9,<j)) we have / rep- 
resented by a function of two variables fi(9, </>) and we may ask if this function 
is differentiable or not. On the other hand, / is given in (u,v) coordinates by 
a representative function f2(u,v). In order that our conclusions about differen- 
tiability at some point p € U\ flt/ 2 C S should not depend on what coordinate 
system we use we had better have the coordinate systems themselves related 
differentiably. That is, we want the coordinate change functions in both direc- 
tions to be differentiable. For example we may then relate the derivatives as 
they appear in different coordinates by chain rules expressions like 

Ofi _ d /2 du df 2 dv 
09 ~ du 09 + Ov 09 

which have validity on coordinate overlaps. The simplest and most useful con- 
dition to require is that coordinates systems have C°° coordinate changes on 
the overlaps. 
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Definition 26.88 A set M is called a C°° differentiable manifold of dimension 
n if M is covered by the domains of some family of coordinate mappings or 

charts {x a : U a — > where x a = (x^,x^, x”). We require that the 

coordinate change maps xp o x” 1 are continuously differentiable any number 
of times on their natural domains in R n . In other words, we require that the 
functions 


x 


l 






x n p = x^(xj,,...,x^) 


together give a C°° bijection where defined. The a and [3 are just indices from 
some set A and are just a notational convenience for naming the individual 
charts. 


Note that we are employing the same type of abbreviations and abuse of no- 
tation as is common is every course on calculus where we often write things like 
y = y{x). Namely, (x* , ...,x”) denotes both an n-tuple of coordinate functions 
and an element of K”. Also, Xp = xjg(x* , ..., x™) etc. could be thought of as an 
abbreviation for a functional relation which when evaluated at a point p on the 
manifold reads 

(xp(p), x n p{p)) = xp0 7L- l {x 1 a {p),...,xl{p)). 


A function / on M will be deemed to be C r if its representatives f a are 
all C r for every coordinate system x a = (x^,...,x”) whose domain intersects 
the domain of f. Now recall our example of temperature on a surface. For 
an arbitrary pair of coordinate systems x = (x 1 ,...^") and y = ( y 1 ,...,y n ) 
the functions f\ := / o x^ 1 and := / o y^ 1 represent the same function / 
with in the coordinate domains but the expressions and are not equal 
and do not refer to the same physical or geometric reality. The point is simply 
that because of our requirements on the smooth relatedness of our coordinate 
systems we know that on the overlap of the two coordinate systems if / o x^ 1 
has continuous partial derivatives up to order k then the same will be true of 

f ° y _1 - 

Also we have the following notations 
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C°°{U) or T{U) 

C™(U) or V{U) 

T P M 

TM, with tm '■ TM — > M 
T p f : T p M - T f[p) N 
Tf : TM -> 7W 
T*M 

T*M, with 7 Tm : T* M —> M 
J X (M, N) y 

X(U),X M (U) (or X(M)) 

(x, U ), (xq , t/ Q ), (V73, (7/3), (</?, / 7 ) 


Smooth functions on U 

with compact support in U 
Tangent space at p 
Tangent bundle of M 
Tangent map of / : M N at p 
Tangent map of / : M — > N 
Cotangent space at p 
Cotangent bundle of M 
fc-jets of maps f :: M,x —> N,y 
Vector held over U (or over M) 
Typical charts 
Tensors on V 
Tensor fields on M 


TJ (V) r-contravariant s-covariant 
T g(M) r-contravariant s-covariant 
d exterior derivative, differential 
V covariant derivative 


M, g Riemannian manifold with metric tensor g 

M, u> Symplectic manifold with symplectic form u> 

L(V, W) Linear maps from V to W. Assumed bounded if V,W are Banach 
Lg(V, W) r-contravariant, s-covariant multilinear maps V* r xV s — > W 

For example, let V be a vector space with a basis f 1; ...,f n and dual basis 
f 1 , ..., f” for V*. Then we can write an arbitrary element v £ V variously by 


v = v x ii I 1- v n f n — ^2 i 


= (fl n) 


/ V 1 


\ v r ‘ 


while a € V* would usually be written as one of the following 
a = aif 1 + • • • + a n f" = ^ a,f ! 


= (ai, a n ) 


We also sometimes use the convention that when an index is repeated once up 
and once down then a summation is implied. For example, a(v) = a,;U* means 
a(v) = Y J i a i v ' 1 - 

Another useful convention that we will use often is that when we have a list 
of objects (oi, On) then (oi, bj, ..., on) will mean the same list with the 
i-tli object omitted. 

Finally, in some situations a linear function A of a variable, say h, is written 
as Ah or 4 • ft instead of A(h). This notation is particularly useful when we 
have a family of linear maps depending on a point in some parameter space. 
For example, the derivative of a function / : R n — > at a point x £ R" is a 

linear map Df(x) : R" — > R m and as such we may apply it to a vector h £ R". 
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But to write Df{x){h) is a bit confusing and so we write Df{ x) ■ h or Df \ x h 
to clarify the different roles of the variables x and h. As another example, if 
x — > A{x) is an m x n matrix valued function we might write A x h for the matrix 
multiplication of A(x) and h £ R". 

In keeping with this we will later think of the space L(V, W) of linear maps 
from V to W as being identified with W(g> V* rather than V* (g> W (this notation 
will be explained in detail). Thus (w ® a)(u) = wa(v) = a(v)w. This works 
nicely since if w = (w 1 , ..., w m ) t is a column and a = (ai, a n ) then the linear 
transformation w (& a defined above has as matrix 

© 2000 Jeffrey Marc Lee 



452 


CHAPTER 26. 


APPENDICES 



Chapter 27 

Bibliography 


453 



454 


CHAPTER 27. 


BIBLIOGRAPHY 



Bibliography 


[A] 


[Arm] 

[At] 

[A,B,R] 


[Arn] 


[A] 


[Bott and Tu] 
[Bry] 

[Ben] 


[Bre] 


[Chavl] 

[Chav2] 

[Drin] 


J. F. Adams, Stable Homotopy and Generalized Homol- 
ogy, Univ. of Chicago Press, 1974. 

M. A. Armstrong, Basic Topology, Springer- Verlag, 1983. 

M. F. Atiyah, K-Theory, W. A. Benjamin, 1967. 

Abraham, R., Marsden, J.E., and Ratiu, T., Manifolds, 
tensor analysis, and applications, Addison Wesley, Read- 
ing, 1983. 

Arnold, V.I., Mathematical methods of classical mechan- 
ics, Graduate Texts in Math. 60 , Springer- Verlag, New 
York, 2nd edition (1989). 

Alekseev, A.Y., On Poisson actions of compact Lie 
groups on symplectic manifolds, J. Diff. Geom. 45 (1997), 
241-256. 


D. J. Benson, Representations and Cohomology, Volume 
II: Cohomology of Groups and Modules, Cambridge Univ. 
Press, 1992. 

[1] R. Bott and L. Tu, Differential Forms in Algebraic Topol- 
ogy, Springer- Verlag GTM 82,1982. 

G. Bredon, Topology and Geometry , Springer-Verlag 
GTM 139, 1993. 


Drinfel’d, V.G., On Poisson homogeneous spaces of 
Poisson-Lie groups , Theor. Math. Phys. 95 (1993), 524- 
525. 


455 



BIBLIOGRAPHY 


456 

[Dieu] 

[Do] 

[Dug] 

[Eil,St] 

[Fen] 

[Fr,Q] 

[Fult] 

[Gl] 

[G2] 

[Gu,Hu,We] 


[Gray] 

[Gre,Hrp] 


[Hilt2] 


[Huss] 


[Hu] 

[KM] 

[Kirb,Seib] 


J. Dieudonn'e, A History of Algebraic and Differential 
Topology 1900-1960, Birkh"auser,1989. 

A. Dold, Lectures on Algebraic Topology , Springer- 
Verlag, 1980. 

J. Dugundji, Topology , Allyn & Bacon, 1966. 

S. Eilenberg and N. Steenrod, Foundations of Algebraic 
Topology , Princeton Univ. Press, 1952. 

R. Fenn, Techniques of Geometric Topology, Cambridge 
Univ. Press, 1983. 

M. Freedman and F. Quinn, Topology of 4- Manifolds, 
Princeton Univ. Press, 1990. 

W. Fulton, Algebraic Topology: A First Course, 

Springer-Verlag, 1995. 

Guillemin, V., and Sternberg, S., Convexity properties of 
the moment mapping, Invent. Math. 67 (1982), 491-513. 

Guillemin, V., and Sternberg, S., Symplectic Techniques 
in Physics, Cambridge Univ. Press, Cambridge, 1984. 

Guruprasad, K., Huebschmann, J., Jeffrey, L., and We- 
instein, A., Group systems, groupoids, and moduli spaces 
of parabolic bundles, Duke Math. J. 89 (1997), 377-412. 

B. Gray, Homotopy Theory, Academic Press, 1975. 

M. Greenberg and J. Harper, Algebraic Topology: A First 
Course, Addison- Wesley, 1981. 

[2] P. J. Hilton, An Introduction to Homotopy Theory, Cam- 
bridge University Press, 1953. 

P. J. Hilton and U. Stammbach, A Course in Homological 
Algebra, Springer-Verlag, 1970. 

D. Husemoller, Fibre Bundles, McGraw-Hill, 1966 (later 
editions by Springer-Verlag). 

Huebschmann, J., Poisson cohomology and quantization, 
J. Reine Angew. Math. 408 (1990), 57-113. 


R. Kirby and L. Siebenmann, Foundational Essays on 
Topological Manifolds, Smoothings, and Triangulations, 
Ann. of Math. Studies 88, 1977. 



BIBLIOGRAPHY 


457 


[LI] 

Lang, S. Foundations of Differential Geometry , Springer- 
Verlag GTN vol 191 

[M,T,W] 

Misner,C. Wheeler, J. and Thome, K. Gravitation, Free- 
man 1974 

[Mil] 

Milnor, J., Morse Theory, Annals of Mathematics Stud- 
ies 51 , Princeton U. Press, Princeton, 1963. 

[MacL] 

S. MacLane, Categories for the Working Mathematician, 
Springer- Ver lag GTM 5, 1971. 

[Mass] 

W. Massey, Algebraic Topology: An Introduction, Har- 
court, Brace & World, 1967 (reprinted by Springer- 
Verlag) . 

[Mass2] 

W. Massey, A Basic Course in Algebraic Topology, 
Springer- Verlag, 1993. 

[Maun] 

C. R. F. Maunder, Algebraic Topology , Cambridge Univ. 
Press, 1980 (reprinted by Dover Publications). 

[Milnl] 

J. Milnor, Topology from the Differentiable Viewpoint, 
Univ. Press of Virginia, 1965. 

[Mil, St] 

J. Milnor and J. Stasheff, Characteristic Classes , Ann. of 
Math. Studies 76, 1974. 

[Roe] 

Roe,J. Elliptic Operators, Topology and Asymptotic 
methods, Longman, 1988 

[Spv] 

Spivak, M. A Comprehensive Introduction to Differential 
Geometry, (5 volumes) Publish or Perish Press, 1979. 

[St] 

Steenrod, N. Topology of fiber bundles, Princeton Univer- 
sity Press, 1951. 

[Va] 

Vaisman, I., Lectures on the Geometry of Poisson Mani- 
folds, Birkhauser, Basel, 1994. 

[Wei] 

Weinstein, A., Lectures on Symplectic Manifolds, Re- 
gional conference series in mathematics 29 , Amer. Math. 
Soc., Providence, 1977. 

[We2] 

Weinstein, A., The local structure of Poisson manifolds, 
J. Diff. Geom. 18 (1983), 523-557. 

[We3] 

Weinstein, A., Poisson structures and Lie algebras, 
Asterisque, hors serie (1985), 421-434. 

[We4] 

Weinstein, A., Groupoids: Unifying Internal and Exter- 
nal Symmetry, Notices of the AMS, July 1996. 



458 


BIBLIOGRAPHY 


[3] J. A. Alvarez Lopez, The basic component of the mean 
curvature of Riemannian foliations, Ann. Global Anal. 
Geom. 10(1992), 179-194. 

[BishCr] R. L. Bishop and R. J. Crittenden, Geometry of Mani- 

folds , New York:Academic Press, 1964. 

[Chavel] I. Chavel, Eigenvalues in Riemannian Geometry, Or- 

lando: Academic Press, 1984. 

[Cheeger] J. Cheeger, A lower bound for the smallest eigenvalue of 

the Laplacian, Problems in analysis (Papers dedicated to 
Salomon Bochner, 1969), pp. 195-199, Princeton, N. J.: 
Princeton Univ. Press, 1970. 

[ChEbin] J. Cheeger and D. Ebin, Comparison Theorems in Rie- 

mannian Geometry, Amsterdam: North-Holland, 1975. 

[Cheng] S.Y. Cheng, Eigenvalue Comparison Theorems and its 

Geometric Applications Math. Z. 143, 289-297, (1975). 

[El-KacimiHector] A. El Kacimi-Alaoui and G. Hector, Decomposition de 
Hodge basique pour un feuilletage Riemannien, Ann. Inst. 
Fourier, Grenoble 36(1986), no. 3 , 207-227. 

[Gray2] A. Gray Comparison Theorems for the Volumes of Tubes 

as Generalizations of the Weyl Tube Formula Topology 
21, no. 2, 201-228, (1982). 

[HeKa] E. Heintz and H. Karcher, A General Comparison Theo- 

rem with Applications to Volume estimates for Submani- 
folds Ann. scient. Ec. Norm Sup., 4 e serie t. 11, 451-470, 
(1978). 

[KamberTondeur] F. W. Kamber and Ph. Tondeur, De Rham-Hodge theory 
for Riemannian foliations, Math. Ann. 277(1987), 415- 
431. 

[Lee] J. Lee, Eigenvalue Comparison for Tubular Domains 

Proc.of the Amer. Math. Soc. 109 no. 3(1990). 

[Min-OoRuhTondeur] M. Min-Oo, E. A. Ruh, and Ph. Tondeur, Vanishing the- 
orems for the basic cohomology of Riemannian foliations, 
J. reine angew. Math. 415(1991), 167 174. 

[Molino] P. Molino, Riemannian foliations, Progress in Mathe- 

matics, BostomBirkhauser, 1988. 

[NishTondeurVanh] S. Nishikawa, M. Ramachandran, and Ph. Tondeur, The 
heat equation for Riemannian foliations, Trans. Amer. 
Math. Soc. 319(1990), 619-630. 



BIBLIOGRAPHY 


459 


[O’Neill] 

B. O’Neill, Semi-Riemannian Geometry, New 

York:Academic Press, 1983. 

[PaRi] 

E. Park and K. Richardson, The Basic Laplacian of a 
Riemannian foliation, Amer. J. Math. 118(1996), no. 6, 
pp. 1249-1275. 

[Ril] 

K. Richardson, The asymptotics of heat kernels on Rie- 
mannian foliations, to appear in Geom. Funct. Anal. 

[Ri2] 

K. Richardson, Traces of heat kernels on Riemannian 
foliations, preprint. 

[Tondeurl] 

Ph. Tondeur, Foliations on Riemannian manifolds, New 
York:Springer Verlag, 1988. 

[Tondeur2] 

Ph. Tondeur, Geometry of Foliations, Monographs in 
Mathematics, vol. 90, Basel: Birkhauser, 1997. 



